5L-TS 10/5378 97 

JCOS Hec'd PCT/PTO 0 7 J UN 2005 

WO 2004/058980 PCT/EP2003/051104 * 



Plants having modified growth characteristics and a method for 
making the same 

The present Invention concerns a method for modifying plant growth characteristics. More 
specifically, the present invention concerns a method for modifying the growth characteristics 
of a plant by modifying expression of a nucleic acid encoding a zinc finger protein and/or by 
modifying the level and/or activity of a zinc finger protein in a plant, which zinc finger protein 
has two zinc finger domains of the type C2H2 (2xC2H2). The present Invention also concerns 
plants having modified expression of a nucleic acid encoding a 2xC2H2 zinc finger protein 
and/or modified levels and/or activity of a 2xC2H2 zinc finger protein, which plants have 
modified growth characteristics relative to corresponding wild type plants. 

Given the ever-increasing world population, it remains a major goal of agricultural research to 
improve the efficiency of agriculture. Conventional means for crop and horticultural 
improvements utilise selective breeding techniques to identify plants having desirable 
characteristics. However, such selective breeding techniques have several drawbacks, namely 
that these techniques are typically labour intensive and result in plants that often contain 
heterogeneous genetic components that may not always result in the desirable trait being 
passed on from parent plants. Advances in molecular biology have allowed mankind to modify 
the germplasm of animals and plants in a specific and controlled way. Genetic engineering of 
plants entails the isolation and manipulation of genetic material (typically in the form of DNA or 
RNA) and the subsequent introduction of that genetic material into a plant. Such technology 
has led to the development of plants having various improved economic, agronomic or 
horticultural traits. A trait or growth characteristic of particular economic interest is high yield. 
Yield is normally defined as the measurable produce of economic value from a crop. This may 
be defined in terms of quantity and/or quality. Other important growth characteristics include 
modified architecture, modified growth rate, among others. 

The ability to influence one or more of the abovementioned growth characteristics, would have 
many applications in areas such as crop enhancement, plant breeding, production of 
ornamental plants, arboriculture, horticulture, forestry, production, of algae or plants (for 
example for use as bioreactors, for the production of substances such as pharmaceuticals, 
antibodies, or vaccines, or for the bioconversion of organic waste or for use as fuel in the case 
of high-yielding algae and plants). 
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The term "zinc finger" describes a nucleic acid-binding domain in a protein that is folded 
around a tetrahedrally coordinated Zinc ion (Miller et al. 1985. EMBO, 4, 1609-1614). The 
amino acids that coordinate the zinc ion, are always cysteln or histidine residues, however, 
diversity occurs in the sequence and length of the zinc finger domain. Zinc finger proteins may 
5 contain several zinc finger domains of the same or different type. Further variability is 
encountered in nature by association of zinc finger domains with other domains. For example, 
some zinc finger proteins are found in association with ring finger or coil-coil domains, to form 
a so-called tripartite domain. There are several types of zinc fingers, such as C2H2, C2HC, 
C2C2. C2H2 is known as the classical zinc finger domain. There are typically two criteria used 

10 to classify zinc finger proteins, the first being the type of zinc finger and the second being the 
number of zinc fingers present in the protein. Zinc finger proteins having a single C2H2 domain 
have been characterised, for example Superman from Arabidopsis and Ramosa I from maize. 
A well-characterised zinc finger protein having three C2H2 domains is the Indeterminate 1 
protein from Maize. Although the first report of this gene (Colasantl et al., Cell. 1998 May 

15 15;93(4):593-603) only mentions the presence of two zinc finger domains, a more 
sophisticated analysis, using pFAM domain search, revealed the presence of three C2H2zinc 
finger domains. Also known are zino-finger proteins having only two C2H2 domains, for 
example ZAT10 (STZ )and SCOF-1. This subset of plant zinc finger proteins having two C2H2 
domains have been implicated in plant responses to various stresses (Sakamoto et aL, Gene 

20 248 (1-2) 23-32 (2000)). Both STZ and SCOF-1 have been used to enhance abiotic stress 
tolerance. When over-expressed, STZ has been reported to increase salt tolerance in yeast 
(Lippuner et al., J Biol Chem. 271 (22) 12859-12866 (1996)) and over-expression of the 
SCOF-1 gene under control of the CaMV 35 S promoter has been reported to enhance cold 
tolerance in Arabidopsis thaliana (Kim ef a/., Plant J. 25 (3) 247-259 (2001)). Reports of plants 

25 having modified expression of a zinc finger encoding gene (whether the zinc finger gene is 
mutated, over-expressed or otherwise) describe plants having abnormal growth 
characteristics, none of which (with the exception of cold stress tolerance In transgenic plants 
expressing SCOF-1) are desirable for crops or describe effects that are only detectable under 
particular stress conditions. 

30 

It has now been found that modifying expression in a plant of a 2xC2H2 zinc finger gene 
and/or modifying the level and/or activity in a plant of a 2xC2H2 zinc finger protein gives plants 
having modified growth characteristics. In particular it has been found that introduction into a 
plant of a 2xC2H2 zinc finger nucleic acid gives plants modified growth characteristics, such as 
35 increased yield, modified leaf architecture and altered cycle time, each relative to wild type 
plants. 
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Therefore according to one embodiment of the present invention there is provided a method 
for modifying the growth characteristics of a plant, comprising modifying expression in a plant 
of a nucleic acid encoding a 2xC2H2 zinc finger protein and/or modifying level and/or activity in 
a plant of a 2xC2H2 zinc finger protein. 

5 

The term "modifying" as used herein is taken to mean enhancing, decreasing and/or changing 
in place and/or time. Modifying expression of a nucleic acid encoding a 2xC2H2 zinc finger 
protein or modifying the level and/or activity of the 2xC2H2 zinc finger protein itself 
encompasses altered expression of a gene and/or altered level and/or activity of a gene 

10 product, namely a polypeptide, in specific cells or tissues, when compared to expression, level 
and/or activity of a 2xC2H2 zinc finger gene or protein in corresponding wild-type plants. The 
modified gene expression may result from modified expression of an endogenous 2xC2H2 zinc 
finger gene and/or may result from modified expression of a 2xC2H2 zinc finger gene 
previously introduced into a plant. Similarly, modified levels and/or activity of a 2xC2H2 zinc 

15 finger protein may be due to modified expression of an endogenous 2xC2H2 zinc finger 
nucleic acid/gene and/or due to modified expression of a 2xC2H2 zinc finger nucleic acid/gene 
previously introduced into a plant Modified expression of a gene/nucleic acid and/or modified 
level and/or activity of a gene product/protein may be effected, for example, by chemical 
means and/or recombinant means. 

20 

Therefore there is provided by the present invention, a method for modifying the growth 
characteristics of a plant, comprising modifying expression, level and/or activity of a 2xC2H2 
zinc finger gene or protein by recombinant means and/or by chemical means. 

25 Advantageously, modifying expression of a nucleic acid encoding a 2xC2H2 zinc finger protein 
and/or modifying level and/or activity of the 2xC2H2 zinc finger protein itself may be effected 
by chemical means, i.e. by exogenous application of one or more compounds or elements 
capable of modifying activity of the 2xC2H2 zinc finger protein and/or capable of modifying 
expression of a 2xC2H2 zinc finger gene (which may be either an endogenous gene or a 

30 transgene introduced into a plant). The term "exogenous application" as defined herein is 
taken to mean the contacting or administering of a suitable compound or element to a plant 
The compound or element may be exogenously applied to a plant in a form suitable for plant 
uptake (such as through application to the soil for uptake via the roots, or in the case of some 
plants by applying directly to the leaves, for example by spraying). The exogenous application 

35 may take place on wild-type plants or on transgenic plants that have previously been 
transformed with a 2xC2H2 zinc finger nucleic acid/gene or other transgene. 
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Suitable compounds or elements for exogenous application include 2xC2H2 zinc finger 
proteins or 2xC2H2 zinc finger nucleic acids. Alternatively, exogenous application of 
compounds or elements capable of modifying levels of factors that directly or indirectly activate 
or inactivate a 2xC2H2 zinc finger protein will also be suitable in practising the invention. Also 
5 included are antibodies that can recognise or mimic the function of 2xC2H2 zinc finger 
proteins. Such antibodies may comprise "plantibodies", single chain antibodies, IgG antibodies 
and heavy chain camel antibodies, as well as fragments thereof. 

Additionally or alternatively, the resultant effect may also be achieved by the exogenous 
10 application of an interacting protein or activator or an inhibitor of a 2xC2H2 zinc finger 
gene/gene product. Additionally or alternatively, the compound or element may be a mutagenic 
substance, such as a chemical selected from any one or more of. N-nitroso-N-ethylurea, 
ethylene imine, ethyl methanesulphonate and diethyl sulphate. Mutagenesis may also be 
achieved by exposure to ionising radiation, such as X-rays or gamma-rays or ultraviolet light. 
15 Methods for introducing mutations and for testing the effect of mutations (such as by 
. monitoring gene expression and/or protein activity) are well known in the art. 

Additionally or alternatively, and according to a preferred embodiment of the present invention, 
modifying expression of a nucleic acid encoding a 2xC2H2 zinc finger protein and/or modifying 
20 level and/or activity of the 2xC2H2 zinc finger protein may be effected by recombinant means. 
Such recombinant means may comprise a direct and/or indirect approach for modifying 
expression of a nucleic acid and/or level and/or activity of a protein. 

For example, an indirect approach may comprise introducing, into a plant, a nucleic acid 
25 capable of modifying expression of the gene in question (a gene encoding a 2xC2H2 zinc 
finger protein) and or capable of modifying the level and/or activity of the protein in question (a 
2xC2H2 zinc finger protein). Examples of such nucleic acids to be introduced into a plant 
include nucleic acids encoding transcription factors or activators or inhibitors that bind to the 
promoter of a 2xC2H2 zinc finger gene or that interact with a 2xC2H2 zinc finger protein. 
30 Methods to test these types of interactions and methods for isolating nucleic acids encoding 
such interactors include yeast one-hybrid or yeast two-hybrid screens in which the 2xC2H2 
zinc finger gene/protein is used as bait. One example of such a transcription regulator is LOS2, 
described as a transcription regulator for the STZ gene. Therefore, the method of the invention 
may also be performed using LOS2, wherein expression of a 2xC2H2 zinc finger gene may be 
35 increased or further increased by decreasing expression of LOS2 in plants. 
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Also encompassed by an indirect approach for modifying expression of a 2xC2H2 zinc finger 
gene and/or for modifying level and/or activity of a 2xC2H2 zinc finger protein is the provision 
of, or the inhibition or stimulation of regulatory sequences that drive expression of a native 
2xC2H2 zinc finger gene or transgene. Such regulatory sequences may be introduced into a 
5 plant. For example, the regulatory sequence to be introduced into a plant may be a promoter 
capable of driving expression of an endogenous 2xC2H2 zinc finger gene. 

A further indirect approach for modifying expression of a 2xC2H2 zinc finger gene and/or for 
modifying level and/or activity of a 2xC2H2 zinc finger protein In a plant encompasses 

10 modifying levels in a plant of a factor capable of interacting with a zinc finger protein. Such 
factors may include ligands of a 2xC2H2 zinc finger protein. Therefore, the present invention 
also provides a method for modifying growth characteristics of a plant, comprising modifying 
expression of a gene coding for a protein which is a natural ligand of a 2xC2H2 zinc finger 
protein. Furthermore, the present Invention also provides a method for modifying growth 

15 characteristics of a plant, comprising modifying expression of a gene coding for a protein which 
is a natural target/substrate of a 2xC2H2 zinc finger protein. Examples of such 
targets/substrates include stretches of DNA that are bound by the zinc-finger domains. 

A direct and preferred approach on the other hand comprises introducing into a plant a nucleic 
20 acid encoding a 2xC2H2 zinc finger protein or a portion thereof or sequences capable of 
hybridising therewith, which nucleic acid preferably encodes a 2xC2H2 zinc finger protein or a 
homologue, derivative or active fragment thereof. The nucleic acid may be introduced into a 
plant by, for example, transformation. 

25 Therefore, there is provided a method for modifying growth characteristics of a plant, 
comprising introducing Into a plant a 2xC2H2 zinc finger nucleic acid or a portion thereof. 

The 2xC2H2 zinc finger nucleic acid may be derived (either directly or indirectly Qf 
subsequently modified)) from any source provided that the sequence, when expressed in a 

30 plant, leads to modified expression of a 2xC2H2 zinc finger-encoding nucleic acid/gene and/or 
modified level and/or activity of a 2xC2H2 zinc finger protein. The 2xC2H2 zinc finger gene or 
protein may be wild type, i.e. the native or endogenous nucleic acid or polypeptide. 
Alternatively, it may be a protein or nucleic acid derived from the same or another species. The 
nucleic acid/gene may then be introduced into a plant as a transgene, for example by 

35 transformation. 
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The nucleic acid may be isolated from a bacteria, yeast or fungi, or from a plant, algae, insect 
or animal (including human) source. This nucleic acid may be substantially modified from its 
native form In composition and/or genomic environment through deliberate human 
manipulation. The nucleic acid is preferably obtained from a plant, whether from the same 
5 plant species in which it is to be introduced or whether from a different plant species. Further 
preferably, the nucleic acid is from a dicot, preferably from the family Brassicaceae, further 
preferably from Arabidopsls thaliana. More preferably, the nucleic acid is essentially similar to 
a nucleic acid as represented by SEQ ID NO 1, or a portion of SEQ ID NO 1, or a nucleic acid 
capable of hybridising therewith or is a nucleic acid encoding an amino acid sequence 
10 essentially similar to an amino acid as represented by SEQ ID NO 2, or a homologue, 
derivative or active fragment thereof. 

Advantageously, the methods according to the invention may also be practised using variant 
2xC2H2 zinc finger nucleic acids and variant 2xC2H2 zinc finger amino acids, preferably 
15 wherein the variant nucleic acids are variants of SEQ ID NO 1 and wherein the variant amino 
acids are variants of SEQ ID NO 2. Examples of variant sequences suitable in performing the 
methods of the invention include: 

(i) Functional portions of a 2xC2H2 zinc finger nucleic acid/gene; 

(ii) Sequences capable of hybridising with a 2xC2H2 zinc finger nucleic acid/gene; 
20 (iii) Alternative splice variants of a 2xC2H2 zinc finger nucleic acid/gene; 

(iv) Allelic variants of a 2xC2H2 zinc finger nucleic acid/gene; 

(v) Homologues, derivatives and active fragments of a 2xC2H2 zinc finger protein. 

The abovementioned variants may also be described as being "essentially similar" to a 
25 2xC2H2 zinc finger nucleic acid/gene, particularly to the 2xC2H2 zinc finger encoding nucleic 
acid of SED ID NO 1, or essentially similar to a 2xC2H2 zinc finger amino acid/protein, 
particularly that of SED ID NO 2. The term "essentially similar to" also includes variants of 
SEQ ID NO 1 in the form of a complement, DNA, RNA, cDNA or genomic DNA. The variant 
nucleic acid encoding a 2xC2H2 zinc finger protein or the variant of a 2xC2H2 zinc finger 
30 protein may be synthesized in whole or in part, it may be a double-stranded nucleic acid or a 
single-stranded nucleic acid. Also, the term encompasses a variant due to the degeneracy of 
the genetic code; a family member of the gene or protein; and variants that are interrupted by 
one or more intervening sequences. 

35 An example of a variant 2xC2H2 zinc finger nucleic acid is a functional portion of a 2xC2H2 
zinc-finger gene. Advantageously, the method according to the present invention may also be 
practised using portions of a DNA or nucleic acid encoding a 2xC2H2 zinc finger protein. A 
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functional portion refers to a piece of DNA derived or prepared from an original (larger) DNA 
molecule, which DNA portion, when expressed in a plant, gives plants having modified growth 
characteristics. The portion may comprise many genes, with or without additional control 
elements or may contain spacer sequences. The portion may be made by making one or 
5 more deletions and/or truncations to the nucleic acid. Techniques for introducing truncations 
and deletions into a nucleic acid are well known in the art. Portions suitable for use in the 
methods according to the invention may readily be determined by following the methods 
described in the Examples section by simply substituting the sequence used in the actual 
Example with the portion to be tested for functionality. 

10 

An example of a further variant 2xC2H2 zinc finger nucleic acid is a sequence that is capable 
of hybridising to a 2xC2H2 zinc finger nucleic acid, for example to any of SEQ ID NO 1, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 41, 43, 45, 47 or 49. Advantageously, the 
methods according to the present invention may also be practised using these variants. 
15 Hybridising sequences suitable for use in the methods according to the invention may readily 
be determined for example by following the methods described in the Examples section by 
simply substituting the sequence used in the actual Example with the hybridising sequence. 

The term "hybridisation" as defined herein is a process wherein substantially homologous 

20 complementary nucleotide sequences anneal to each other. The hybridisation process can 
occur entirely in solution, i.e. both complementary nucleic acids are in solution. Tools in 
molecular biology relying on such a process include the polymerase chain reaction (PCR; and 
all methods based thereon), subtractive hybridisation, random primer extension, nuclease S1 
mapping, primer extension, reverse transcription, cDNA synthesis, differential display of RNAs, 

25 and DNA sequence determination. The hybridisation process can also occur with one of the 
complementary nucleic acids immobilised to a matrix such as magnetic beads, Sepharose 
beads or any other resin. Tools in molecular biology relying on such a process include the 
isolation of poly (A+) mRNA. The hybridisation process can furthermore occur with one of the 
complementary nucleic acids immobilised to a solid support such as a nitro-cellulose or nylon 

30 membrane or immobilised by e.g. photolithography to, for example, a siliceous glass support 
(the latter known as nucleic acid arrays or microarrays or as nucleic acid chips). Tools in 
molecular biology relying on such a process include RNA and DNA gel blot analysis, colony 
hybridisation, plaque hybridisation, in situ hybridisation and microarray hybridisation. In order 
to allow hybridisation to occur, the nucleic acid molecules are generally thermally or chemically 

35 denatured to melt a double strand into two single strands and/or to remove hairpins or other 
secondary structures from single stranded nucleic acids. The stringency of hybridisation is 
influenced by conditions such as temperature, salt concentration and hybridisation buffer 
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composition. High stringency conditions for hybridisation include high temperature and/or low 
salt concentration (salts include NaCI and Na 3 -citrate) and/or the inclusion of formamide in the 
hybridisation buffer and/or lowering the concentration of compounds such as SDS (detergent) 
in the hybridisation buffer and/or exclusion of compounds such as dextran sulphate or 
5 polyethylene glycol (promoting molecular crowding) from the hybridisation buffer. Conventional 
hybridisation conditions are described in, for example, Sambrook (2001) Molecular Cloning: a 
laboratory manual, 3rd Edition Cold Spring Harbor Laboratory Press, CSH, New York, but the 
skilled craftsman will appreciate that numerous different hybridisation conditions may be 
designed in function of the known or the expected homology and/or length of the nucleic acid 

10 sequence. Sufficiently low stringency hybridisation conditions are particularly preferred (at 
least in the first instance) to isolate nucleic acids heterologous to the DNA sequences of the 
invention defined supra. An example of low stringency conditions is 4-6x SSC / 0.1-0.5% w/v 
SDS at 37-45°C for 2-3 hours. Depending on the source and concentration of the nucleic acid 
involved in the hybridisation, alternative conditions of stringency may be employed, such as 

15 medium stringency conditions. Examples of medium stringency conditions include 1-4x SSC / 
0.25% w/v SDS at a 45°C for 2-3 hours. An example of high stringency conditions includes 0.1 
to 2x SSC / 0.1% w/v SDS at 60°C for 1-3 hours. The skilled man will be aware of various 
parameters which may be altered during hybridisation and washing and which will either 
' maintain or change the stringency conditions. The stringency conditions may start low and be 

20 progressively increased until there is provided a hybridising nucleic acid, as defined 
hereinabove. Elements contributing to heterology include allelism, degeneration of the genetic 
code and differences in preferred codon usage. 

Another variant 2xC2H2 zinc finger nucleic acid useful in practising the methods according to 
25 the present invention is an alternative splice variant of a nucleic acid sequence encoding a 
2xC2H2 zinc finger protein. The term "alternative splice variant" as used herein encompasses 
variants of a nucleic acid sequence in which selected introns and/or exons have been excised, 
replaced or added. Such splice variants may be found in nature or may be manmade. Methods 
for making such splice variants are well known in the art. Splice variants suitable for use in the 
30 methods according to the invention may readily be determined for example by following the 
methods described in the Examples section by simply substituting the sequence used in the 
actual Example with the splice variant. 

Another variant 2xC2H2 zinc finger nucleic acid useful in practising the methods according to 
35 the present invention is an allelic variant of a nucleic acid encoding a 2xC2H2 zinc finger 
protein. Allelic variants exist in nature and encompassed within the methods of the present 
invention is the use of these natural alleles. Allelic variants also encompass Single Nucleotide 
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Polymorphisms (SNPs), as well as Small Insertion/Deletion Polymorphisms (INDELs). The size 
of INDELs is usually less than 100 bp. SNPs and INDELs form the largest set of sequence 
variants in naturally occurring polymorphic strains of most organisms. Allelic variants suitable 
for use in the methods according to the invention may readily be determined for example by 
5 following the methods described in the Examples section by simply substituting the sequence 
used In the actual Example with the allelic variant. 

The present invention provides a method for modifying plant growth characteristics, comprising 
modifying expression in a plant of an alternative splice variant or expression in a plant of an 
1 0 allelic variant of a nucleic acid encoding a 2xC2H2 zinc finger protein and/or by modifying level 
and/or activity in a plant of a 2xC2H2 zinc finger protein encoded by the alternative splice 
variant or allelic variant 

Examples of variant 2xC2H2 zinc finger proteins useful in practicing the methods of the 
15 present invention are homologues, derivatives or functional fragments of a 2xC2H2 zinc finger 
protein. 

"Homologues" of a "2xC2H2 zinc finger protein encompass peptides, oligopeptides, 
polypeptides, proteins and enzymes having amino acid substitutions, deletions and/or 

20 insertions relative to the unmodified protein in question and having similar biological and 
functional activity as the unmodified protein from which they are derived. To produce such 
homologues, amino acids of the protein may be replaced by other amino acids having similar 
properties (such as similar hydrophobicity, hydrophilicity, antigenicity, propensity to form or 
break o-helical structures or p-sheet structures). Conservative substitution tables are well 

25 known in the art (see for example Creighton (1984) Proteins. W.H. Freeman and Company). 
The homologues useful in the method according to the invention have at least in increasing 
order of preference 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41%, 42%, 
43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 52%, 54%, 56%, 58%, 60%, 62%, 64%, 66%, 
68%, 70%, 72%, 74%, 76%, 78%, 80%, 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98% 

30 sequence identity or similarity to an unmodified protein. 

The percentage of identity may be calculated by using an alignment program well known in the 
art. For example, the percentage of identity may be calculated using the program GAP, or 
needle (EMBOSS package) or stretcher (EMBOSS package) or the program align X, as a 
35 module of the vector NTI suite 5.5 software package, using the standard parameters (for 
example GAP penalty 5, GAP opening penalty 15, GAP extension penalty 6.6). 
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According to another embodiment of the present invention, the nucleic acid sequence useful in 
the methods of the present invention is a nucleic acid encoding a protein homologous to SEQ 
ID NO 2. 

Methods for the search and identification of 2xC2H2 zinc finger protein homologues, for 
5 example STZ zinc finger homologues, would be well within the realm of a person skilled in the 
art. Such methods, involve screening sequence databases with the sequences provided by the 
present invention, for example SEQ ID NO 2 (or SEQ ID NO 1), preferably in a computer 
readable format. This sequence information may be available in public databases, that include 
but are not limited to Genbank (htto://www.ncbl.nlm.nlh.QOv/web/Genbank). the European 

10 Molecular Biology Laboratory Nucleic acid Database (EMBL) (http:Aw.ebi.ac.uk/ebi-docs/embl- 
db.html) or versions thereof or the MIPS database (http://mips.gsf.de/). Different search 
algorithms and software for the alignment and comparison of sequences are well known in the 
art. Such methods include GAP, BESTFIT, BLAST, FASTA and TFASTA. GAP uses the 
algorithm of Needleman and Wunsch (J. Mol. Biol. 48: 443-453, 1970) to find the alignment of 

15 two complete sequences that maximises the number of matches and minimises the number of 
gaps. The BLAST algorithm calculates percent sequence identity and performs a statistical 
analysis of the similarity between the two sequences. The suite of programs referred to as 
BLAST programs has 5 different implementations: three designed for nucleotide sequence 
queries (BLASTN, BLASTX, and TBLASTX) and two designed for protein sequence queries 

20 . (BLASTP and TBLASTN) (Coulson, Trends in Biotechnology: 76-80, 1994; Birren et al., 
GenomeAnalysis, 1: 543, 1997). The software for performing BLAST analysis is publicly 
available through the National Centre for Biotechnology Information. 

Default blast parameters to find useful homologues of any of SEQ ID NO 1, SEQ ID NO 2 or 
25 any of SEQ ID NO 10 to SEQ ID NO 50, are, when comparing nucleotide sequence G (Cost to 
open a gap) 5, E (Cost to extend a gap default) 2, q (Penalty for a mismatch) -3, r (Reward for 
a match) 1, e (Expectation value (E)) 10.0, W (Word size) 11, V (Number of one-line 
descriptions) 100 and B (Number of alignments to show) 100. When comparing protein 
sequences, the default parameters are preferably G 11, E 1, e value 10.0, W 3. V 100 and B 
30 100. 

The above-mentioned analyses for comparing sequences, for the calculation of sequence 
identity and for the search for homologues, is preferentially done with full-length sequences or 
within a conserved region of the sequence. Therefore, these analyses may be based on a 
35 comparison of certain regions such as conserved domains, motifs or boxes. 
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The identification of such domains or motifs for examples the motif and boxes as represented 
by SEQ ID NO 5, 6, 7, 8 and 9, would also be well within the realm of a person skilled in the art 
and involves for example, a computer readable format of proteins of the present invention, the 
use of alignment software programs and the use of publicly available information on protein 
5 domains, conserved motifs and boxes. This protein domain information is available in the 
PRODOM (http://www.biochem.ucl.ac.uk/bsm/dbbrowse^ PIR 
(http://pir.georgetown.edu/) or pFAM (http://pfam.wustl.edu/) database. For the identification of 
Zinc finger domains, such as the 2xC2H2 zinc finger domain, pFAM is preferred. Sequence 
analysis programs designed for motif searching may be used for Identification of fragments, 

10 regions and conserved domains as mentioned above. Preferred computer programs would 
include but are not limited to MEME, SIGNALSCAN, and GEIMESCAN. A MEME algorithm 
(Version 3.0) may be found in the GCG package; or on the Internet site 
http://www.sdsc.edu/lvlElvlE/meme. SIGNALSCAN version 4.0 information is available on the 
Internet site http^/blosci.cbs.umn.edu/software/sigscan.html. GENESCAN may be found on 

1 5 the Internet site http://gnomic.stanford.edu/GENESCANW.html. 

At present, zinc finger motifs are subdivided in more than 40 different classes as can be found 
In the Pfam database of protein families present at the Sanger institute 
(http://www.sanger.ac.Uk/Software/Pfam/browse/Z.shtml). 

20 

The C2H2 zinc finger (Zf-C2H2) motif is the classical zinc finger domain. It was first recognized 
in the transcription factor IMA (TFIIIA) of Xenopus (Miller at al. 1985). The domain is typically 
25 to 30 amino-acid residues in length. The following pattern describes the zinc finger *X-C- 
X(1-5)-C-X3-*X5-*X2-H-X(3-6)-[H/C] where X can be any amino acid, and numbers in brackets 

25 indicate the number of residues. The positions marked * are those that are important for the 
stable folding of the zinc finger. The final position can be either his or cys, while still being a 
C2H2 zinc finger domain. In view of recent publications on the design of zinc finger domains it 
becomes feasible also to replace one or more of the Cys or His amino acids, whilst still 
retaining the original functionality of the C2H2 domain. The residues separating the second 

30 Cys and the first His are mainly polar and basic. The canonical C2H2 zinc finger is composed 
of two short beta strands followed by an alpha helix. DNA binding of the zinc finger motif is 
mediated by amino terminal part of the alpha helix which binds the major groove in DNA 
binding zinc fingers. C2H2 domains have been shown to interact with RNA, DNA and proteins. 
The tetracoordination of a Zinc ion by the conserved cystein and histidine residues determines 

35 the conserved tertiary structure of the motif. Conserved hydrophobic residues are commonly 
found at positions -2 and also at 4 amino acids after the second cystein (that participates in 
zinc binding) and at position three before the first histidine (that participates in zinc binding). In 



WO 2004/058980 



PCT7EP2003/051104 



plant multi zinc finger proteins, spacing between the C2H2 domains is generally about 15 to 
about 65 amino acids. 

Thus, plant zinc finger proteins are characterized by long spacers of diverse lengths between 
5 adjacent fingers. Moreover, they are characterised by a highly conserved sequence of six 
amino acids, located within a putative DNA-contacting surface of each finger. Two forms of 
such conserved sequence are most commonly found in plant C2H2 zinc fingers, the QALGGH 
(SEQ ID NO 5) and the NNM/WQMH (SEQ ID NO 6). Despite the high sequence conservation 
of the QALGGH, some variants or the so-called 'modified type' occur in nature where one or 

10 two amino acids can have a different form, most typically the +1 "Q" can be a "G"," K" or "R" 
(these amino acids share the same turn-like characteristic), the +2 "A" can be "S" (both of 
which share the characteristic of being small amino acids) or the +3" "L can be T" (these two 
amino acids are both hydrophobic). The QALGGH-motif as used herein comprises all these 
variants. In the NNM/WQMH motif at position 3 there is mostly an "M" or a "W. 

15 Therefore, the present invention provides a method as described hereinabove, wherein said 
2xC2H2 zinc finger protein comprises a QALGGH motif. Further, The present invention 
provides a s described hereinabove, wherein said 2xC2H2 zinc finger protein comprises a 
NNM/WQMH motif. 

20 According to one embodiment of the invention, both C2H2 domains are of the same type. More 
preferably, both C2H2 zinc finger domains have the same conserved GALGGH or 
NNM/WQMH motif. According to another embodiment, each C2H2 zinc finger domain has a 
different conserved motif. 

25 According to one embodiment, the 2xC2H2 protein useful in the methods of the present 
invention is characterized by an EAR motif, which is an ERF-Associated amphiphilic 
repression motif. This motif has been recognized in two unrelated types of transcription factors, 
namely the ERF transcription factors of the AP2 type and in the zinc finger transcription 
factors. In the latter class, the EAR motif is generally located at the C-terminus of the protein. 

30 The pattern for the EAR motif has the conserved sequence hDLNh(X)P (SEQ ID NO 7), where 
°h" is a hydrophobic residue (any one of A,C,F,G,H,I,K,L,M,R,T,V,W,Y) most typically L/F/l and 
where "X" can be one (any amino acid) or no amino acid. A characteristic feature of the EAR 
motif is the alternation of hydrophilic and hydrophobic residues with the aspartic acid (D) 
residue being amphiphilic. Ohta et al. (The plant cell, 2001, 13, p1 959-1 968), which reference 

35 is cited herein by reference, previously characterized EAR motifs present in 2xC2H2 zinc 
finger proteins. 
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Therefore, the present invention provides a method as described hereinabove, wherein the 
2xC2H2 zinc finger protein comprises an EAR motif. According to one embodiment, the EAR 
motif is located in the C-terminal region of the protein, preferably between the second zinc 
finger domain and the C-terminus. 

5 

According to a further embodiment, the zinc finger proteins used in the methods of the present 
invention have two zinc finger domains and a nuclear localization signal (B-box). A cluster of 
basic amino acids that resembles the B-box (Basic box) were described by Chua et al. (EMBO 
1992- 11, 241-9) and were hypothesized to be a nuclear localization signal for the protein. 

10 These have been recognized in 2xC2H2 proteins (Sakamoto et al., Gene 248 (2000) 23-32). 
The cluster is rich in Lysine (K) and Arginine (R) residues. A consensus sequence defining the 
most frequent form of the B-box in 2xC2H2 genes is KR(S)KRXR (SEQ ID NO 8) where "S" at 
the 3rd position may be absent or present However other variants may occur in nature that 
still retain the characteristic of being a charged region rich in basic amino acids. The location of 

15 the basic box is most frequently at the N-terminus of the protein, but can also occur in other 
locations. It has been speculated that due to its basic nature the B-box could also participate in 
DNA binding. 

Accordingly, the present Invention provides a method as described hereinabove, wherein the 
20 2xC2H2 zinc finger protein further comprises a B-box. According to one embodiment the B-box 
is located in the N-terminal region of the zinc finger protein. Preferably the proteins useful in 
the methods of the present invention have a B-box located between the N-terminus and the 
first zinc finger domain. 

25 According to a further embodiment, the zinc finger proteins useful in the methods of the 
present Invention have two C2H2 zinc finger domains and an L-box. A conserved motif, named 
L-box, of yet unknown function has been identified in 2xC2H2 proteins and has been described 
previously by Sakamoto et al. (Gene 248 (2000) 23-32). The L-box is typically located at the N- 
terminus, between the B-box and the first C2H2 zinc finger. The L-box is represented by the 

30 sequence EXEXXAXCLXXL (SEQ ID NO 9). This region may be involved in protein-protein 
interactions. Zinc finger proteins lacking the L-box, may for example have serine rich regions at 
a similar position, which regions are putative sites for protein -protein interactions. 

Therefore, the present invention provides a method as described hereinabove, wherein the 
35 2xC2H2 protein comprises an L-box. 
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Particular zinc finger homologues useful in the methods of the present invention have one or 
more of the conserved motifs as depicted in SEQ ID NO 5, 6, 7, 8 and 9, or motifs that are 
80% identical to these motifs or motifs that have conserved substitutions of amino acids. The 
2xC2H2 protein as set forth in SEQ ID NO 2 comprises all the boxes as set forth in SEQ ID NO 
5 5, 7, 8 and 9. All its paralogues and orthologues also comprise all of these boxes. 

Homologues of a 2xC2H2 protein as presented in SEQ ID NO 2 and isolated from Arabidopsls 
thaliana, that are useful in the constructs and the methods of the present invention are also 
identified in other plant species. 

10 

Two special forms of homologue, orthologues and paralogues, are evolutionary concepts used 
to describe ancestral relationships of genes. The term "paralogue" relates to a gene- 
duplication within the genome of a species leading to paralogous genes. The term "orthologue" 
relates to a homologous gene in different organisms due to ancestral relationship. The term 
15 "homologue" as used herein also encompasses paralogues and orthologues of the proteins 
useful in the methods according to the invention. 

Othologues in other plant species may easily be found by performing a so-called reciprocal 
blast search. Orthologous genes can be identified by querying one or more gene databases 

20 with a query gene or protein of interest (SEQ ID NO 1 or 2), using for example BLAST 
program. The highest-ranking subject genes that result from the search are then again 
subjected to a BLAST analysis, and only those subject genes that match again with the query 
sequence (SEQ ID NO 1 or 2) are retained as true orthologous genes. For example, to find a 
rice orthologue of an Arabidopsls thaliana gene, one may perform a BLASTN or TBLASTX 

25 analysis on a rice database such as (but not limited to) the Oryza sativa Nipponbare database 
available at the NCBI website (http7AAAww.ncbi.nlm.nih.gov) or the genomic sequences of rice 
(cultivars indica or japonica). In a next step, the obtained rice sequences are used in a reverse 
BLAST analysis using an Arabidopsis database. The results may be further refined when the 
resulting sequences are analysed with ClustafW and visualised in a neighbour joining tree. 

30 The method can be used to identify orthologues from many different species. 

The closest homologues in other species (orthologues of the protein of SEQ ID NO 2), include 
those from a variety of dicot and monocot plants, for example from Datisca glomerata 
(AF119050_1, AAD26942, SEQ ID NO 10 and 11), from soybean (TO9602, SCOF-1, SEQ ID 
35 NO 12 and 13), Medicago sativa (CAB77055.1, SEQ ID NO 14 and 15), from tobacco 
(T01985, SEQ ID NO 16 and 17) from rice, (AF332876_1, AAK01713.1, SEQ ID NO 18 and 
19), from petunia (BAA05079.1, SEQ ID NO 20 and 21), from wheat (S39045 and BAA03901, 
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WZF1, SEQ ID NO 22 and 23), from Capsicum annum (SEQ ID NO 24 and 25), from turnip 
(T14408, T14409) and from sugarcane (CA279020). 

Close homologues of the same species (paralogues of the protein of SEQ ID NO 2 from 
5 Arabidopsis thaliana) are described below. 

The MIPS database contains the sequence of the Arabidopsis thaliana genome with prediction 
and functional annotation of the proteins encoded. Searching this database with the STZ gene 
of SEQ ID NO 1 (MIPS accession number At1g27730), showed that in the Arabidopsis 
genome there are 2 genes encoding very close homologues of SEQ ID NO 2, At5g43170 

10 (NM_123683, SEQ ID NO 32 and 33) and At5g04340 (NM_1 20516 SEQ ID NO 28 and 29), 
and 3 others with high similarity: At3g19580 (NM_1 12848, SEQ ID NO 26 and 27), At5g67450 
(NMJ26145, SEQ ID NO 34 and 35) and At3g49930 (NM_1 14853. SEQ ID NO 30-31). These 
genes are spread over 3 chromosomes, 1, 3 and 5. Similarly, a number of paralogues of the 
orthologue in Petunia have been isolated and sequenced. Advantageously, paralogues from 

15 the same species may be used in the methods of the present invention. 

Furthermore, a number of family members of the STZ protein of SEQ ID NO 2 have been 
found in Ambisopsis. The STZ gene and protein of SEQ ID NO 1 and 2 have been previously 
published in the database under the MIPS accession number At1g27730 or in Genbank under 

20 the accession numbers NP_174094.1, X95573 or CAA64820. Additionally, several other 
cDNA's, isolated from other tissues or at different developmental stages of Arabisopsis have 
been reported and encode the same protein as that of SEQ ID NO 2. S uch sequences 
sequences deposited under the Genbank accession number AY034998, NM_1 02538, 
AC12375, X95573, AY063006. X98671, X98670, or AF250336. These isolates illustrate the 

25 differential expression of the STZ gene In different plant tissues at different developmental 
stages. The differential regulation of these different cDNA's is reflected by the differences at 
their 5'UTR and the 3'UTR regions, while the encoded protein remains the same. 
Advantageously, the members of the same gene family as SEQ ID NO 1 or the members of 
the same family of any of the orthologues of SEQ ID NO 1 , may be used in the methods of the 

30 present invention. 

Other close homologues useful in the methods of the present invention are the sequences as 
deposited in the public database under the following accession numbers, which sequences are 
herein incorporated by reference: homologues isolated from Petunia: BAA21923.1, 
35 BAA21922.1, BAA21926.1, BAA21925.1. BAA19110.1, BAA19926.1, BAA21924.1, 
BAA19111.1, BAA21921.1, BAA19114.1, BAA05076.1, BAA05079.1, CAA43111.1, 
BAA21920.1, BAA21919.1, BAA05077.1, BAA05078.1, BAA20137.1; homologues isolated 

15 
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from Arabidopsis: CAA67229.1, BAC43454.1, NP_196054.1, AAM67193.1, NP.199131.1. 
NPJ88592.1, NP_201546.1, NP_190562.1, NP_182037.1, BAC43008.1, Q8VWG3, 
CAC86393.1, CAC86168.1, CAC86167.1, CAC86166.1, CAB67667.1, CAC01747.1, 
CAB90936.1, CAB90935.1, CAB80245.1, CAB41188.1, CAA18741.1, CAA67234.1, 
5 CAA67236.1, CAA67231.1, CAA67230.1, CAA67228.1, CAA67235.1, CAA67233.1, 
CAA67232.1, CAA67229.1, CAA64820.1 and homologues isolated finorii rice: BAB16855.1, 
AAO06972.1, CAC09475.1, BAB63718.1, P0683F02.21, BAB67885.1, P0031D11.19, 
BAB64114.1, AAK01713.1, AF332876_1, AAL76091.1, BAB67879.1, P0031D11.12 and 
BAC15513.1. 

10 

A phylogenetic tree may be constructed with ail the homologues, paralogues and orthologues 
are defined herein above. Multiple alignment q may be made using clustal W present in the 
VNTi (version 5.0) program with for example Gap opening penalty 10 and Gap extention 5. For 
making a phylogenetic tree the Phyiic software package available at 
15 htto://evolution.Qenetics.was hinoton.edu/ohvfip.html may be used. Sequences clustering 
around SEQ ID NO 1 or SEQ ID NO 2, identify genes or proteins suitable for use in the 
methods of the present invention. 

The sequence of SEQ ID NO 2 and its rice orthologue AF332876 (SEQ ID NO 19) have 36% 
20 sequence identity when using the program Needle with the parameters Gap penalty 5 and Gap 
extension penalty 6. Therefore, homologues particularly useful in the methods of the present 
invention are homologues having 36% or more sequence identity with the 2xC2H2 zinc finger 
protein as presented in SEQ ID NO 2 or having 36% or more sequence identity to the closest 
orthologue of SEQ ID NO 2 from another species. 

25 

Preferred homologues useful in practicing the methods of the present invention are plant 
homologues, i.e. proteins obtained from a plant nucleic acid. More preferably, the nucleic acid 
sequence is from a dicot, more preferably from the family Brassicaceae, further preferably from 
Arabidopsis thaliana. 

30 

Preferably the 2xC2H2 zinc finger protein useful in the methods of the present invention 
belongs to the same gene family as the salt tolerant zinc finger protein (STZ) of Arabidopsis 
thaliana, or is a homologues thereof. The name ZAT10 can also be used to identify the STZ 
zinc finger protein of Arabidopsis thaliana. 

35 

Another variant of a zinc finger protein useful in the methods of the present invention is a 
substitutional variant The term "Substitutional variants" of a protein refers to those variants in 
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which at least one residue in an amino acid sequence has been removed and a different 
residue inserted in its place. Amino acid substitutions are typically of single residues, but may 
be clustered depending upon functional constraints placed upon the polypeptide; insertions will 
usually be of the order of about 1-10 amino acid residues, and deletions will range from about 
5 1-20 residues. Preferably, amino acid substitutions comprise conservative amino acid 
substitutions. Particular substitutional variants of the C2H2 zinc finger protein are substitutional 
variants in which one or more of the conserved Cys and/or His residues is replaced, whilst 
retaining the same zinc finger functionality. To retain the same functionality, the residues 
around these conserved Cys of His residues may also be substituted. 

10 

"lnsertional variants" of a protein are those in which one or more amino acid residues are 
introduced into a predetermined site in said protein. Insertions can comprise amino-terminal 
and/or carboxy-terminal fusions as well as intra-sequence insertions of single or multiple amino 
acids. Generally, insertions within the amino acid sequence will be smaller than amino- or 

15 carboxy-terminal fusions, of the order of about 1 to 10 residues. Examples of amino- or 
carboxy-terminal fusion proteins or peptides include the binding domain or activation domain of 
a transcriptional activator as used in the yeast two-hybrid system, phage coat proteins, 
(histidine)e-tag, glutathione S-transferase-tag, protein A, maltose-binding protein, dihydrofolate 
reductase, Tag«100 epitope, c-myc epitope, FLAG®-epitope, lacZ, CMP (calmodulin-binding 

20 peptide), HA epitope, protein C epitope and VSV epitope. 

"Deletion variants" of a protein are characterised by the removal of one or more amino acids 
from the protein. Amino acid variants of a protein may readily be made using peptide synthetic 
techniques well known in the art, such as solid phase peptide synthesis and the like, or by 

25 recombinant DNA manipulations. Methods for the manipulation of DNA sequences to produce 
substitution, insertion or deletion variants of a protein are well known in the art. For example, 
techniques for making substitution mutations at predetermined sites in DNA are well known to 
those skilled In the art and include M13 mutagenesis, T7-Gen in vitro mutagenesis (USB, 
Cleveland, OH), QuIckChange Site Directed mutagenesis (Stratagene, San Diego, CA), PCR- 

30 mediated site-directed mutagenesis or other site-directed mutagenesis protocols. 

The term "derivatives" refers to peptides, oligopeptides, polypeptides, proteins and enzymes 
which may comprise substitutions, deletions or additions of naturally and non-naturally 
occurring amino acid residues compared to the amino acid sequence of a naturally-occurring 
35 form of the 2xC2H2 protein such as for example the 2xC2H2 zinc finger protein as presented 
in SEQ ID NO 2. "Derivatives" of a 2xC2H2 zinc finger protein encompass peptides, 
oligopeptides, polypeptides, proteins and enzymes which may comprise naturally occurring 
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altered, glycosylated, acylated or non-naturally occurring amino acid residues compared to the 
amino acid sequence of a naturally-occurring form of the polypeptide. A derivative may also 
comprise one or more non-amino acid substituents compared to the amino acid sequence from 
which it is derived, for example a reporter molecule or other ligand, covalently or non- 
5 covalently bound to the amino acid sequence such as, for example, a reporter molecule which 
is bound to facilitate its detection, and non-naturally occurring amino acid residues relative to 
the amino acid sequence of a naturally-occurring protein. 

Another variant of a 2xC2H2 zinc finger protein useful in the methods of the present invention 
10 is an active fragment of a zinc finger protein. 'Active fragments" of a 2xC2H2 zinc finger 
protein encompasses at least five contiguous amino acid residues of a protein, which residues 
retain similar biological and/or functional activity to the naturally occurring protein. For 
example, useful fragments comprise at least 10 contiguous amino acid residues of a 2xC2H2 
zinc finger protein. Other preferred fragments are fragments of a 2xC2H2 zinc finger protein 
1 5 starting at the second or third or further internal methionin residues. These fragments originate 
from protein translation, starting at internal ATG codons. Functional fragments of a 2xC2H2 
zinc finger protein useful in practising the methods of the present invention may have one, two 
• or no C2H2 domains, without affecting its functionality in the methods of the present invention. 

20 According to a preferred feature of the present invention, enhanced or Increased expression of 
a nucleic acid encoding a 2xC2H2 zinc finger protein is envisaged. Methods for obtaining 
enhanced or increased expression of genes or gene products are well documented In the art 
and include, for example, over-expression driven by a strong promoter, the use of transcription 
enhancers or translation enhancers. The term over-expression as used herein means any form 

25 of expression that is additional to the original wild-type expression level. Preferably the nucleic 
acid to be introduced into the plant and/or the nucleic acid that is to be overexpressed in the 
plant is in the sense direction with respect to the promoter to which it is operably linked. 
Preferably, the nucleic acid sequence represented by SED ID NO 1 is over-expressed in a 
plant. However, it should be clear that the applicability of the invention is not limited to use of 

30 the nucleic acid represented by SEQ ID NO 1 nor to the nucleic acid sequence encoding the 
amino acid sequence of SEQ ID NO 2, but that other nucleic acid sequences encoding 
homologues, derivatives or active fragments of SED ID NO 1 or SED ID NO 2 may be useful in 
the methods of the present invention. Examples of nucleic acids or proteins are provided in 
SEQ ID NO 10 to SEQ ID NO 50. 

35 

Alternatively and/or additionally, increased expression of a 2xC2H2 encoding gene or 
increased level and/or activity of a 2xC2H2 protein in a plant cell, is achieved by mutagenesis. 
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For example these mutations may be responsible for altered control of the 2xC2H2 gene, 
resulting in more expression of the gene, relative to the wild-type gene. Mutations can also 
cause conformational changes in a protein, resulting in more activity and/or higher levels of the 
2xC2H2 protein. 

5 

Modifying gene expression (whether by a direct or indirect approach) encompasses altered 
transcript levels of a gene. Altered transcript levels may be sufficient to induce certain 
phenotypic effects, for example via the mechanism of cosuppression. Here the overall effect of 
Introduction of a transgene is that there is less activity In the cell of the protein encoded by a 

10 native gene having homology to the introduced transgene. Therefore, according to another 
embodiment of the present invention, there is provided a method for modifying growth 
characteristics in a plant, comprising decreasing expression of a gene encoding a 2xC2H2 zinc 
finger protein or decreasing level and/or activity of a 2xC2H2 zinc finger protein. Examples of 
decreasing expression, level and/or activity of a protein in a cell are well documented in the art 

15 and include, for example, downregulation of expression by anti-sense techniques, RNAi 
techniques, small interference RNAs (siRNAs) and microRNA (miRNA) . 

Another method for downregulation of gene expression or gene silencing comprises use of 
ribozymes, for example as described in Atkins et al. 1994 (WO 94/00012), Lenee et al. 1995 
20 (WO 95/03404), Lutziger et al. 2000 (WO 00/00619), Prinsen et al. 1997 (WO 97/3865) and 
Scott et al. 1997 (WO 97/38116). 

Gene silencing may also be achieved by insertion mutagenesis (for example, T-DNA insertion 
or transposon insertion) or by gene silencing strategies as described by, among others, Angell 
25 and Baulcombe 1998 (WO 98/36083), Lowe et al. 1989 (WO 98/53083), Lederer et al. 1999 
(WO 99/15682) or Wang et al. 1999 (WO 99/53050). 

Expression of an endogenous gene may also be reduced if it contains a mutation. Such a 
mutation or such a mutant gene may be isolated and introduced into the same or different 
30 plant species in order to obtain plants having modified growth characteristics. Examples of 
such mutants are dominant negative mutants of a 2xC2H2 zinc finger gene. 

Genetic constructs aimed at silencing gene expression may comprise the 2xC2H2 zinc finger 
nucleic acid, for example as represented by SEQ ID NO 1 (or one or more portions thereof or a 
35 sequence capable of hybridising therewith), in a sense and/or antisense orientation relative to 
the promoter sequence. The sense or antisense copies of at least part of the endogenous 
gene in the form of direct or inverted repeats may also be utilised in the methods according to 
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the invention. The growth characteristics of plants may also be modified by introducing into a 
plant at least part of an antisense version of the nucleotide sequence represented by SEQ ID 
N01. 

5 According to a further embodiment of the present invention, genetic constructs and vectors to 
facilitate introduction and/or to facilitate expression of the 2xC2H2 zinc finger nucleotide 
sequences useful in the methods according to the invention are provided. Therefore, according 
to the present invention, there is provided a construct comprising: 

(i) a nucleic add capable of modifying expression of a nucleic acid encoding a 
10 2xC2H2 zinc finger protein and/or modifying level and/or activity of a 2xC2H2 zinc finger 

protein; 

(ii) one or more control sequence capable of driving expression of the nucleic acid 
sequence of (i); and optionally 

(Hi) a transcription termination sequence. 

15 

Constructs useful in the methods according to the present invention may be constructed using 
recombinant DNA technology well known to persons skilled in the art The gene constructs 
may be inserted into vectors, which may be commercially available, suitable for transforming 
into plants and suitable for expression of the gene of interest in the transformed cells. 
20 Preferably the genetic construct is a plant expression vector. 

The nucleic acid according to (i) is advantageously any of the nucleic acids described 
hereinbefore. A preferred nucleic acid is the nucleic acid represented by SEQ ID NO 1 or a 
variant thereof as hereinbefore defined, or is a nucleic acid sequence encoding a sequence 
25 represented by SEQ ID NO 2 or a variant as hereinbefore defined. For example such variants 
encode a protein as presented in any of SEQ ID NO 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31. 
33, 35. 37, 39, 42, 44, 46, 48 and 50. 

The terms "regulatory element" and "control sequence" are used herein interchangeably and 
30 are to be taken in a broad context to refer to regulatory nucleic acids capable of effecting 
expression of the sequences to which they are operably linked. Encompassed by the 
aforementioned terms are promoters. A "promoter" encompasses transcriptional regulatory 
sequences derived from a classical eukaryotic genomic gene (including the TATA box which is 
required for accurate transcription initiation, with or without a CCAAT box sequence) and 
35 additional regulatory elements (i.e. upstream activating sequences, enhancers and silencers) 
which alter gene expression in response to developmental and/or external stimuli, or in a 
tissue-specific manner. Also included within the term is a transcriptional regulatory sequence 

20 
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of a classical prokaryotic gene, in which case it may include a -35 box sequence and/or -10 
box transcriptional regulatory sequences. The term "regulatory element" also encompasses a 
synthetic fusion molecule or derivative which confers, activates or enhances expression of a 
nucleic acid molecule in a cell, tissue or organ. The term "operably linked" as used herein 
5 refers to a functional linkage between the promoter sequence and the gene of interest, such 
that the promoter sequence is able to initiate transcription of the gene of interest. Preferably, 
the gene of interest is operably linked to a promoter in a sense direction. 

Advantageously, any type of promoter may be used to drive expression of the nucleic acid 
10 sequence depending on the desired outcome. 

Promoters useful for the present invention are described in EP 03075331.3, which promoters 
and sequences are incorporated herein by reference. 

Other examples of preferred promoters are presented in Table I (a) to (c), which promoters or 
15 derivatives thereof are useful in the methods and/or in making the constructs of the present 
invention. Accordingly, genetic constructs comprising of the nucleic acids of (i), for example a 
2xC2H2 nucleic acid, and at least part of a promoter from Table I (a) to (c) or from EP 
03075331.3, preferably, wherein said parts are operably linked, are also provided by the 
present invention. 

20 

According to onother embodiment, the nucleic acid of (i) is operably linked to a constitutive 
promoter. The term "constitutive" as defined herein refers to a promoter that is expressed 
substantially continuously. Furthermore, preferably the constitutive promoter is a ubiquitous 
promoter, which is expressed in more than one, preferably in most or substantially all tissues 

25 of the plant. Preferably, the constitutive promoter to be used in the methods of the present 
invention, or cloned in the genetic constructs of the present invention, is a plant promoter, 
preferably a constitutive promoter, such as a GOS2 promoter or a promoter with similar 
strength and/or similar expression pattern. Preferably plant promoters derived from a plant 
nucleic acid are used. Alternatively, promoters operable in plant, such as promoters derived 

30 from plant pathogens are used. 

According to another embodiment of the invention, the nucleic acid of (i) is operably linked to a 
plant promoter, preferably a tissue-preferred promoter. The term "tissue-preferred" as used 
herein refers to a promoter that is expressed predominantly in at least one tissue or organ. For 
35 example, the tissue-preferred promoter is a seed-preferred promoter, such as a pWS18 
(Joshee et al. Plant Cell Physiol. 1998 Jan;39(1):64-72. ) or a promoter of similar strength 
and/or similar expression pattern. 
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Promoters with similar strength and/or similar expression pattern may be found by coupling the 
promoter to a reporter gene and checking the function of the reporter gene in different tissues 
of a plant. One suitable reporter gene is beta-glucuronidase and the colorimetric GUS staining 
5 to visualize the beta-glucuronidase activity in a plant tissue is well known to a person skilled in 
the art. 

Table I (a): flower preferred promoters useful in the present invention. Sequences of these 
promoters are described in the cited reference, which sequences are herein incorporated by 
10 reference. 



Gene 


Expression 


Reference 


A1PRP4 


flowers 


http://salus.medium.edu/mmg/tiemey/html 


chalene synthase (chsA) 


flowers 


Van der Meer, atal., Plant Mol. Biol. 15, 95- 
109, 1990. 


LAT52 


anther 


Twell era/ Mol. Gen Genet. 217:240-245 
(1989) 


apetala-3 


flowers 





Table I (b): seed-preferred promoters useful in the present invention. Sequences of these 
15 promoters are described in the cited reference, which sequences are herein Incorporated by 
reference. 



Gene 


Expression 


Reference 


seed-specific genes 


seed 


Simon, ef a/., Plant Mol. Biol. 5: 191, 1985; 
Scofleld, et al., J. Biol. Chem. 262: 12202, 
1987.; Baszczynski, etal.. Plant Mol. Biol. 14: 
633, 1990. 


Brazil Nut albumin 


seed 


Pearson, et al., Plant Mol. Biol. 18: 235-245, 
1992. 


legumin 


seed 


Ellis, et al., Plant Mol. Biol. 10: 203-214, 
1988. 


glutelin (rice) 


seed 


Takalwa, et al., Mol. Gen. Genet. 208: 15-22, 
1986; Takaiwa, et al., FEBS Letts. 221: 43- 
47, 1987. 


zeln 


seed 


Matzke et al Plant Mol Biol, 14(3):323-32 
1990 


napA 


seed 


Stalberg, etal, Planta 199: 515-519, 1996. 
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wheat LMW and HMW glutenin-1 


endosperm 


Mol Gen Genet 216:81-90, 1989; NAR 
17:461-2, 1989 


wheat SPA 


seed 


Albani ©fa/, Plant Cell, 9: 171-184, 1997 


wheat a, B, y-glladlns 


endosperm 




barley Itrt promoter 


endosperm 




barley B1, C, D, hordeln 


endosperm 


Theor Appl Gen 98:1253-62, 1999; Plant J 
4:343-55, 1993; Mol Gen Genet 250:750-60, 
1996 


barley DOF 


endosperm 


Mena ef al, The Plant Journal, 116(1): 53-62, 
1998 


blz2 


endosperm 


EP991 06056.7 




endosperm 


Vlcente-Carbajosa ef a/.. Plant J. 13:629- 
640, 1998. 


rice prolamin NRP33 


endosperm 


Wu ef al, Plant Cell Physiology 39(8) 885- 
889, 1998 


rice a-globulin Glb-1 


endosperm 


Wu ef al, Plant Cell Physiology 39(8) 885- 
889 1998 


rice OSH1 


embryo 


Sato ef al, Proc. Natl. Acad. Scl. USA, 93: 


rice arglobulin REB/OHP-1 


endosperm 


Nakase ef al. Plant Mol. Biol. 33: 513-522, 


rice ADP-glucose PP 


endosperm 


Trans Res 6:157-68, 1997 






Plant J 12:235-46, 1997 


sorgum y-kafirin 


endosperm 


PMB 32:1029-35, 1996 


KNOX 


embryo 


Postma-Haarsma ef al. Plant Mol. Biol. 
39257-71,1999 


rice oleosin 


embryo and 
aleuron 


Wu ef af, J. Biochem., 123:386, 1998 


sunflower oleosin 


seed (embryo 
and dry seed) 


Cummins, ef al., Plant Mol. Biol. 19: 873-876, 
1992 



Table I (c): constitutive promoters useful in the present invention. Sequences of these 
promoters are described in the cited reference, which sequences are herein incorporated by 
reference. 



Gene 


Expression 


Reference 


Actin 


constitutive 


McElroy et al, Plant Cell, 2: 163-171 , 
1990 


CAMV35S 


constitutive 


Odell ef al, Nature, 313: 810-812, 
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1985 


CaMV19S 


constitutive 


Nilsson et al.. Physiol. Plant 
700:456-462, 1997 


GOS2 


constitutive 


de Pater et al, Plant J Nov;2(6):837- 
44, 1992 


ubiquitin 


constitutive 


Christensen et al, Plant Mol. Biol. 
18: 675-689, 1992 


rice cyclophilin 


constitutive 


Buchholz etal, Plant Mol Biol. 25(5): 
837-43, 1994 


maize H3 histone 


constitutive 


Lepetit et al, Mol. Gen. Genet. 
231:276-285, 1992 


actin 2 


constitutive 


An et al, Plant J. 10(1); 107-121. 
1996 



Optionally, one or more terminator sequences may also be used in the construct introduced 
Into a plant. The term "terminator 0 encompasses a control sequence which Is a DNA sequence 
at the end of a transcriptional unit which signals 3' processing and polyadenylation of a primary 
5 transcript and termination of transcription. Additional regulatory elements may include 
transcriptional as well as transfational enhancers. Those skilled in the art will be aware of 
terminator and enhancer sequences which may be suitable for use in the invention. Such 
sequences would be known or may readily be obtained by a person skilled in the art. 

10 The genetic constructs of the invention may further include an origin of replication sequence 
which is required for maintenance and/or replication in a specific cell type. One example is 
when a genetic construct is required to be maintained in a bacterial cell as an episomal genetic 
element (e.g. plasmid or cosmid molecule). Preferred origins of replication include, but are not 
limited to, the f1-ori and colE1 . 

15 

The genetic construct may optionally comprise a selectable marker gene. As used herein, the 
term "selectable marker gene" includes any gene which confers a phenotype on a cell in which 
it is expressed to facilitate the identification and/or selection of cells which are transfected or 
transformed with a genetic construct of the invention. Suitable markers may be selected from 
20 markers that confer antibiotic or herbicide resistance. Cells containing the recombinant DNA 
will thus be able to survive in the presence of antibiotic or herbicide concentrations that kill 
untransformed cells. Examples of selectable marker genes include genes conferring resistance 
to antibiotics (such as nptll encoding neomycin phosphotransferase capable of 
phosphorylating neomycin and kanamycin, or hpt encoding hygromycin phosphotransferase 
24 
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capable of phosphorylating hygromycin), to herbicides (for example bar which provides 
resistance to Basta; aroA or gox providing resistance against glyphosate), or genes that 
provide a metabolic trait (such as manA that allows plants to use mannose as sole carbon 
source). Visual marker genes result in the formation of colour (for example beta-glucuronidase, 
5 GUS), luminescence (such as luciferase) or fluorescence (Green Fluorescent Protein, GFP, 
and derivatives thereof). Further examples of suitable selectable marker genes include the 
ampicillin resistance (Ampr), tetracycline resistance gene (Tcr), bacterial kanamycin resistance 
gene (Kanr), phosphinothricin resistance gene, and the chloramphenicol acetyltransferase 
(CAT) gene, amongst others 

10 

The present invention also encompasses plants obtainable by the methods according to the 
present invention. The present invention therefore provides plants obtainable by the method 
according to the present invention, which plants have modified growth characteristics, which 
plants have altered 2xC2H2 zinc finger protein level and/or activity and/or altered expression of 
15 a nucleic acid sequence encoding a 2xC2H2 zinc finger protein. 

Therefore, according to one aspect of the present invention, there is provided a method for the 
production of plants, having modified growth characteristics, comprising introducing, into a 
plant, a nucleic acid capable of modifying activity of a 2xC2H2 zinc finger protein and/or 
20 capable of modifying expression of a 2xC2H2 zinc-finger gene. According to a further 
embodiment of the present invention, there is provided a method for the production of 
transgenic plants having modified growth characteristics, comprising introduction and 
expression in a plant of a 2xC2H2 nucleic acid. 

25 More specifically, the present invention provides a method for the production of transgenic 
plants having modified growth characteristics, which method comprises: 
(i) introducing into a plant or plant cell a 2xC2H2 zinc finger nucleic acid; 

(Hi) cultivating the plant cell under conditions promoting plant growth. 

30 The growth characteristic may be any of the characteristics defined hereinunder. 

The 2xC2H2 zinc finger nucleic acid includes all variant nucleic acids as described herein 
before and includes all nucleic acids encoding all variant proteins as described herein before. 
Cultivating the plant cell under conditions promoting plant growth, may or may not include 
35 regeneration and or growth to maturity. 
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The protein itself and/or the nucleic acid itself may be introduced directly into a plant cell or into 
the plant itself (including introduction into a tissue, organ or any other part of the plant). 
According to a preferred feature of the present invention, the nucleic acid is preferably 
introduced into a plant by transformation. . 

5 

The term "transformation" as referred to herein encompasses the transfer of an exogenous 
polynucleotide into a host cell, irrespective of the method used for transfer. Plant tissue 
capable of subsequent clonal propagation, whether by organogenesis or embryogenesis, may 
be transformed with a genetic construct of the present Invention and a whole plant regenerated 

10 therefrom. The particular tissue chosen will vary depending on the clonal propagation systems 
available for, and best suited to, the particular species being transformed. Exemplary tissue 
targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus 
tissue, existing meristematic tissue (e.g., apical meristem, axillary buds, and root meristems), 
and induced meristem tissue (e.g., cotyledon meristem and hypocotyl meristem). The 

15 polynucleotide may be transiently or stably introduced into a host cell and may be maintained 
non-integrated, for example, as a plasmid. Alternatively, it may be integrated into the host 
genome. The resulting transformed plant cell can then be used to regenerate a transformed 
plant in a manner known to persons skilled in the art 

20 Transformation of a plant species Is now a fairly routine technique. Advantageously, any of 
several transformation methods may be used to introduce the nucleic acid of interest (e.g. the 
2xC2H2 nucleic acid) into a suitable ancestor cell. Transformation methods include the use of 
liposomes, electroporation, chemicals that increase free DNA uptake, injection of the DNA 
directly into the plant, particle gun bombardment, transformation using viruses or pollen and 

25 microprojection. Methods may be selected from the calcium/polyethylene glycol method for 
protoplasts (Krens, F.A. et al., 1882, Nature 296, 72-74; Negrutiu I. et al., June 1987, Plant 
Mol. Biol. 8, 363-373); electroporation of protoplasts (Shillito R.D. et al., 1985 Bio/Technol 3, 
1099-1102); microinjection into plant material (Crossway A. et al., 1986, Mol. Gen Genet 202, 
179-185); DNA or RNA-coated particle bombardment (Klein T.M. et al., 1987, Nature 327, 70) 

30 infection with (non-integrative) viruses and the like. A preferred transformation method Is an 
Argobacterium mediated transformation method. 

Transgenic rice plants expressing a 2xC2H2 gene are preferably produced via Agmbacterium- 
mediated transformation using any of the well-known methods for rice transformation, such as 
35 the ones described in any of the following: published European patent application EP 1198985 
A1, Aldemita and Hodges (Planta, 199, 612-617, 1996); Chan et al. (Plant Mol. Biol. 22 (3) 
491-506, 1993); Hiei et al. (Plant J. 6 (2) 271-282, 1994); which disclosures are incorporated 
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by reference herein as if fully set forth. In the case of corn transformation, the preferred 
method is as described in either Ishida et al. (Nat Biotechnol. 1996 Jun; 14(6): 745-50) or 
Frame et al. (Plant Physiol. 2002 May; 129(1): 13-22), which disclosures are incorporated by 
reference herein as if fully set forth. 

5 

Generally after transformation, plant cells or cell groupings are selected for the presence of 
one or more markers which are encoded by plant-expressible genes co-transferred with the 
gene of interest, following which the transformed material is regenerated into a whole plant 

10 Following DNA transfer and regeneration, putatively transformed plants may be evaluated, for 
instance using Southern analysis, for the presence of the gene of interest, copy number and/or 
genomic organisation. Alternatively or additionally, expression levels of the newly introduced 
DNA may be undertaken using Northern and/or Western analysis, both techniques being well 
known to persons having ordinary skill in the art. 

15 

The generated transformed plants may be propagated by a variety of means, such as by clonal 
propagation or classical breeding techniques. For example, a first generation (or T1) 
transformed plant may be selfed to give homozygous second generation (or T2) transformants, 
and the T2 plants further propagated through classical breeding techniques. 

20 

The generated transformed organisms may take a variety of forms. For example, they may be 
chimeras of transformed cells and non-transformed cells; clonal transformants (e.g., all cells 
transformed to contain the expression cassette); grafts of transformed and untransformed 
tissues (e.g., in plants, a transformed rootstock grafted to an untransformed scion). 

25 

The present invention clearly extends to any plant cell or plant produced by any of the methods 
described herein, and to all plant parts and propagules thereof. The present Invention extends 
further to encompass the progeny of a primary transformed or transfected cell, tissue, organ or 
whole plant that has been produced by any of the aforementioned methods, the only 

30 requirement being that progeny exhibit the same genotypic and/or phenotypic characteristic(s) 
as those produced in the parent by the methods according to the invention. The invention also 
includes host cells having modified expression and/or level and/or activity of a 2xC2H2 zinc 
finger protein. Such host cells for example comprise genetic constructs as mentioned above. 
Preferred host cells according to the invention are derived from a plant, algae, bacterium, 

35 fungus, yeast, insect or animal. The invention also extends to harvestable parts of a plant such 
as but not limited to seeds, leaves, fruits, flowers, petals, stamen, stem cultures, stem, 
rhizomes, roots, tubers, bulbs or cotton fibers. 
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The term "plant" as used herein encompasses whole plants, ancestors and progeny of the 
plants and plant parts, including seeds, shoots, stems, roots (including tubers), and plant cells, 
tissues and organs. The term "plant" also encompasses suspension cultures, embryos, 
5 meristematic regions, callus tissue, leaves, gametophytes, sporophytes, pollen, and 
microspores. Plants that are particularly useful in the methods of the invention include all 
plants which belong to the superfamily Viridiplantae, in particular monocotyledonous and 
dicotyledonous plants including, fodder or forage legumes, ornamental plants, food crop, tree, 
or shrub selected from the list comprising Acacia spp., Acerspp., Actinldia spp.^esculus spp., 

10 Agathis australis, Albizia amara, Alsophita tricolor, Andropogon spp., Arachls spp, Areca 
catechu, Astelia fragrans, Astragalus cicer, Baikiaea plurijuga, Betula spp., Brasstca spp., 
Bruguiera gymnorrhiza, Burkea africana, Butea frondosa, Cadaba farinosa, Calliandra spp, 
Camellia sinensis, Carina Indica, Capsicum spp., Cassia spp., Centroema pubescens, 
Chaenomeles spp.,Cinnamomum cassia, Coffea arabica, Colophospermum mopane, 

15 Coronillia varia, Cotoneaster serotina, Crataegus spp., Cucumis spp., Cupressus spp., 
Cyathea dealbata, Cydonia oblonga, Cryptomeria japonica, Cymbopogon spp., Cynthea 
dealbata, Cydonia oblonga, Dalbergia monetaria, Davallia divaricata, Desmodium spp., 
Dicksonia squarosa, Diheteropogon amplectens, Dioclea spp, Dolichos spp., Dorycnium 
rectum, Echinochloa pyramidalis, Ehrartia spp., Eleusine coracana, Eragrestis spp., Erythrina 

20 spp., Eucalyptus spp., Euclea schimperi, Eulalia villosa, Fagopyrum spp., Feljoa seliowlana, 
Fragaria spp., Flemingia spp, Freycinetla banksii, Geranium thunbergii, Ginkgo biloba, Glycine 
Javanica, Gliricidla spp, Gossypium hirsutum, Grevillea spp., Guibourtia coleosperma, 
Hedysarum spp., Hemarthia altisslma, Heteropogon contortus, Hordeum vulgare, Hyparrhenia 
rufa, Hypericum erectum, Hyperthelia dissoluta, Indigo incarnata, Iris spp., Leptarrhena 

25 pyrolifolla, Lespediza spp., Lettuca spp., Leucaena leucocephala, Loudetla simplex, Lotonus 
bainesli, Lotus spp., Macrotyloma axillare, Malus spp., Manihot esculenta, Medicago saliva, 
Metasequoia glyptostroboides, Musa saplentum, Nicotianum spp., Onobrychis spp., 
Ornithopus spp., Oryza spp., Peltophorum africanum, Pennisetum spp., Persea gratissima, 
Petunia spp., Phaseolus spp., Phoenix canariensis, Phormium cookianum, PhoVnia spp., 

30 Picea glauca, Pinus spp., Pisum sativum, Podocarpus totara, Pogonarthria Heckii, 
Pogonarthria squamosa, Populus spp., Prosopls cineraria, Pseudotsuga menziesil, 
Pterolobium stellatum, Pyrus communis, Quercus spp., Rhaphiolepsis umbellata, 
Rhopalostylis sapida, Rhus natalensis, Ribes gmssularia, Ribes spp., Robinia pseudoacacia, 
Rosa spp., Rubus spp., Salix spp., Schyzachyrium sanguineum, Scladopitys verticillata, 

35 Sequoia sempervirens, Sequoiadendron giganteum, Sorghum blcolor, Spinacla spp., 
Sporobolus fimbriatus, Stiburus alopecuroides, Stylosanthos humilis, Tadehagi spp, Taxodium 
dlstichum, Themeda triandra, Trifolium spp., TriUcum spp., Tsuga heterophylla, Vacclnlum 
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spp., Via'a spp.Vitis vinifera, Watsonia pyramidata, Zantedeschia aethiopica, Zea mays, 
amaranth, artichoke, asparagus, broccoli, brusse) sprout, cabbage, canola, carrot, cauliflower, 
celery, collard greens, flax, kale, lentil, oilseed rape, okra, onion, potato, rice, soybean, straw, 
sugarbeet, sugar cane, sunflower, tomato, squash, and tea, trees and algae amongst others. 
5 According to a preferred embodiment of the present invention, the plant is a crop plant such 
soybean, sunflower, canola, alfalfa, rapeseed, cotton, tomato, potato or tobacco. According to 
another preferred embodiment of the present invention, the plant is a monocotyledonous plant, 
such as sugar cane, further preferably a cereal, most preferably the plant is selected from the 
group consisting of rice, maize, wheat, barley, millet, rye or oats. 

10 

In a particular embodiment of the present invention, proteins of one plant species (for example 
Arabldopsis) are introduced in another plant species (for example rice). It has been shown in 
the present invention that plant growth characteristics are improved by introduction of a 
2xC2H2 zinc finger gene or protein from a dlcot into a monocot 

15 

According to a particular embodiment of the invention, there are provided methods as 
described above, wherein the plant is a monocot More preferably the plant is rice or corn. 

Advantageously, performance of the methods according to the present invention leads to 
20 plants having modified growth characteristics. 

The term "growth characteristic* as used herein, preferably refers to anyone or more of, but is 
not limited to, yield, architecture and cycle time. 

The term "yield" means the amount of harvested material. For crop plants yield also means the 
25 amount of harvested material per acre of production. Depending on the crop the harvested part 
of the plant may be a different part or tissue of the plant, such as seed (e.g. rice, sorghum or 
com when grown for seed); total above-ground biomass (e.g. for com, when used as silage), 
root (e.g. sugarbeet), fruit (e.g. tomato), cotton fibers, or any other part of the plant which is of 
economic value. "Yield" also encompasses yield stability of the plants, meaning that year after 
30 year, one can obtain the same yield from the progeny of the plants, without too much 
interference of external factors, such as weather conditions. "Yield" also encompasses yield 
potential, which as the maximum obtainable yield. 

Yield maybe dependent on a number of yield components. The parameters for these 
components are well known by a person skilled in the art. For example breeders are well 
35 aware of the specific yield components and the corresponding parameters for the crop they are 
aiming to improve. 
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For example key yield components for corn include number of plants per hectare or acre, 
number of ears per plant, number of rows (of seeds) per ear, number of kernels per row, and 
thousand kernel weight. For silage corn typical parameters are the above ground biomass and 
energy content. 

5 Key yield components for rice include number of plants per hectare or acre, number of 
panicles per plant, number of spikelets per panicle, seed filling rate (number of filfed seeds) 
and thousand kernel weight. Preferentially methods for increasing yield of rice encompass 
increased number of flowers per panicle and an increased number of filled seeds. The 
parameter of increased total number of seeds may be linked to increased number of flowers. 
10 "Yield" further encompasses typical biomass components, such as above ground parts of a 
plant and the root system. General biomass parameters are area and dry weight. Specific 
parameters for above ground biomass further encompass above ground area and plant height. 
Specific parameters for the root system encompass root ratio, root length and penetration 
depth, root branching, root hair density, root pulling resistance and aerenchyma formation. 

15 

The plants of the present invention are characterized by increased number of filled seeds, 
increased total seed weight, increased total number of seeds and increased harvest Index. 
Therefore the methods of the present invention are particularly favorable to be applied in 
cereals such as rice and corn (maize). Accordingly, a particular embodiment of the present 
20 Invention relates to a method to increase yield of com, comprising modifying expression of a 
nucleic acid encoding a 2xC2H2 zinc finger protein. 

The plants of the present invention are characterized by an increase in thousand kernel weight 
and therefore the seed size or seed volume and/or the seed content and/or seed composition 
25 are altered by the methods of the present invention. The seeds provided by the methods of the 
present invention may have more nutritional value, more starch and/or more oil, possibly due 
to their bigger size. 

The plants of the present invention are characterized by more above ground area. Therefore, 
30 the methods of the present invention are particularly favorable for crops grown for their green 
tissue and/or grown for their above ground biomass. The methods of the present invention are 
particularly useful for grasses, forage crops (such as forage com (maize), clover, medicago 
etc.), trees, sugar cane etc. 

35 The improvement in yield as obtained by the methods of the invention, may be obtained as a 
result of improvement of one or more of the above mentioned yield components and/or 
parameters. 
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The term "architecture" as used herein encompasses the appearance or morphology of a 
plant, including any one or more structural features or combination of structural features 
thereof. Such structural features include the shape, size, number, position, texture, 
5 arrangement, and pattern of any cell, tissue or organ or groups of cells, tissues or organs of a 
plant, including the root, leaf, shoot, stem, petiole, trichome, flower, petal, stigma, style, 
stamen, pollen, ovule, seed, embryo, endosperm, seed coat, aleurone, fibre, cambium, wood, 
heartwood, parenchyma, aerenchyma. sieve element, phloem or vascular tissue, amongst 
others. Particular architectural characteristics that may be modified by the methods of the 

10 present invention are increased plant height, increased number or size of stems or stalks or 
tillers or panicles or pedicles, increased number or size of inflorescences, increased branching 
of for example of tassels and ears or altered flowering characteristics. A preferred architectural 
characteristic that may be modified by the methods of the present invention is leaf architecture. 
The term "leaf architecture" as used herein comprises typical leaf characteristics such as 

15 length, width, thickness, cell number, cell size and greenness. 

Typically, the plants of the present invention display increased leaf surface area and /or 
increased leaf blade width. This trait is particularly important as it allows the plant to optimize 
the shape of its leaf to maximize the area used for photosynthesis. For that purpose, preferably 
20 the leaf blade is widened, but alternatively, the leaves are longer or smaller or rounder. These 
effects may lead to more healthy plants. Alternatively, this trait attributes aesthetic properties to 
the plant such as greenness and stronger leafs. 

"cycle time" of the plant as used herein means the time wherein a plant reaches 90% of its 
25 . maximum total area. This parameter is an Indication of the duration of the vegetative growth. 
Prolonged vegetative growth was only displayed in some of the plants according to the present 
invention and may be controlled by choice of the transformation event and/or by choice of the 
promoter driving the 2xC2H2 nucleic acid. For example this characteristic was not displayed 
when a seed-preferred promoter was used. 

30 

Other "growth characteristics" that may be improved by the methods of the present invention 
are growth rate, early vigour, modified Tmid, T90 or A42 or altered growth curve. 

It is clear from the data as presented in the examples that one or more of the growth 
35 characteristics as defined herein above, may be combined in one plant. Alternatively, 
depending on the chosen transformation event and/or depending on the promoter used, one 
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or more of these growth characteristics may be present or absent or more or less pronounced 
in the plant. 

The methods of the present invention may also be used to confer stress tolerance to plants. In 
5 particular, a 2xC2H2 of the STZ type may be used to confer to a plant salt stress tolerance 
and/or drought stress tolerance. According to a specirfic embodiment, a tissue preferred 
promoter, such as a seed-preferred promoter" is used in these methods. 

The present invention also relates to use of a nucleic acid sequence encoding a zinc finger 
10 protein and homologues, derivatives and active fragments thereof in modifying the growth 
characteristics of plants, preferably in increasing yield, further preferably increasing seed yield. 
The present invention also relates to use of a nucleic acid sequence encoding a 2xC2H2 zinc 
finger protein and homologues, derivatives and active fragments thereof and to the 2xC2H2 
zinc finger protein itself and to homologues, derivatives and active fragments thereof as a 
15 growth regulator. The sequences represented by SEQ ID NO 1, and portions thereof and SEQ 
ID NO 2, and homologues, derivatives and active fragments thereof are useful in modifying the 
growth characteristics of plants, as hereinbefore described. The sequences would therefore 
find use as growth regulators, such as herbicides or growth stimulators. The present invention 
also provides a composition comprising a protein represented by SEQ ID NO 2, or a 
20 homologue, derivative or active fragment thereof for the use as a growth regulator. A growth 
regulator is used herein as meaning a regulator that increased yield and is therefore also 
referred to as yield regulator. 

In particular, the present invention provides a yield regulating composition comprising a nucleic 
acid encoding a 2xC2H2 protein, and/ or comprising a 2xC2H2 protein, and/or comprising a 
25 construct as defined herein above. Such a yield regulating composition further comprises 
additives normally use In yield regulating compositions, such as a solvent or carrier. 

Conversely, the sequences according to the present invention may also be interesting targets 
for agrochemical compounds, such as herbicides or growth stimulators. Accordingly, the 
30 present invention encompasses use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 
protein and/or of a construct as defined in any of claims 20 to 22 as target for an agrochemical , 
such as a herbicide or a growth stimulator. 

The methods according to the present invention may also be practised by co-expression of a 
gene encoding a 2xC2H2 zinc finger protein in a plant with at least one other gene that 
35 cooperates with the gene encoding a 2xC2H2 zinc finger protein. Such a gene may be a gene 
encoding a target protein of the 2xC2H2 zinc finger protein. Co-expression may be effected by 
cloning the genes under the control of a plant expressible promoter in a plant expressible 
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vector and introducing the expression vector(s) into a plant cell using >Ag/obacfer/um-mediated 
plant transformation. Therefore, the methods according to the present invention may result in 
plants having modified growth characteristics, particularly increased yield, as described 
hereinbefore in combination with other economically advantageous traits, such as further yield- 
5 enhancing traits, tolerance to various stresses, traits modifying various architectural features 
and/or biochemical and/or physiological features. 

Since the plants of the present invention have excellent growth characteristics and have high 
yield, they are suitable for the production of enzymes, pharmaceuticals or agrochemicals. Also, 
10 there are suitable to produce food or feed products. 

The invention clearly extends to enzymes, pharmaceuticals or agrochemicals as well as food 
or feed products Isolated from these plants. 

Further a nucleic acid encoding a 2xC2H2 protein, a 2xC2H2 protein and/or the constructs of 
the present invention may be used breeding programs aiming at the development of plants 
15 with increased yield. 

Particularly, the use of allelic variants as defined above in particular conventional breeding 
programmes, such as in marker-assisted breeding is also encompassed by the present 
invention; this may be in addition to their use in the methods according to the present 

20 invention. Such breeding programmes sometimes require the introduction of allelic variations 
in the plants by mutagenic treatment of a plant. One suitable mutagenic method Is EMS 
mutagenesis. Identification of allelic variants then takes place by, for example, PCR. This is 
followed by a selection step for selection of superior allelic variants of the sequence in question 
and which give rise to altered growth characteristics in a plant. Selection is typically carried out 

25 by monitoring growth performance of plants containing different allelic variants of the sequence 
in question, for example, SEQ ID NO 1. Monitoring growth performance may be done in a 
greenhouse or in the field. Further optional steps include crossing plants in which the superior 
allelic variant was identified with another plant. This could be used, for example, to make a 
combination of interesting phenotypic features 

30 

According to another type of breeding programme a DNA marker is identified which may be 
genetically linked to a gene capable of modifying expression of a nucleic acid encoding a 
2xC2H2 zinc finger protein in a plant, which gene may be a gene encoding the 2xC2H2 zinc 
finger protein itself or any other gene which may directly or indirectly influence expression of 
35 the gene encoding a 2xC2H2 zinc finger protein and/or activity of the 2xC2H2 zinc finger 
protein itself. This DNA marker may then be used in breeding programs to select plants having 
altered growth characteristics. 
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The methods according to the present invention may also be practised by introducing into a 
plant at least a part of a (natural or artificial) chromosome (such as a Bacterial Artificial 
Chromosome (BAC)), which chromosome contains at least a gene encoding a 2xC2H2 zinc 
5 finger protein, optionally together with one or more related gene family members. Therefore, 
according to a further aspect of the present invention, there is provided a method for modifying 
growth characteristics of plants by expressing in a plant at least a part of a chromosome 
comprising at least a gene encoding a 2xC2H2 zinc finger protein. 

10 The present invention will now be described with reference to the following figures in which: 

Fig. 1 is a map of an expression vector for the expression in plants of a 2xC2H2zinc finger 
protein under the control of a GOS2 promoter. CDS1536 is the internal code for the 
Arabldopsis thallana salt tolerant zinc finger (STZ) protein cDNA. The zinc finger protein 

15 expression cassette has a GOS2 promoter and a double terminator sequence (T-zein and T- 
rbcS-deltaGA) located within the left border (LB repeat) and the right border (RB repeat) of the 
Ti plasmid. Cloned within these T-borders are also a screenable marker and a selectable 
marker, each under the control of a constitutive promoter (Prom), foil owed by a terminator 
sequence (poly a and t-NOS). Furthermore, this vector also contains an origin of replication 

20 (pBR322 (ori + bom)) for bacterial replication and a selectable marker (Sm/SpR) for bacterial 
selection. 

Fig. 2A shows digital images from a T1 positive line transformed with an STZ zinc finger 
transgene under control of a GOS2 promoter and Fig. 2B shows digital images of 
25 corresponding nullizygotes plants. 

Fig. 3 lists sequences useful in the methods of the present invention. SEQ ID NO 1 is an STZ 
encoding nucleic acid Isolated from Ambidopsis thaliana; the start and the stop codon are 
highlighted in bold. SEQ ID NO 2 is the STZ protein sequence encoded by SEQ ID NO 1. In 

30 the STZ protein the nuclear localization signal also called the KRS motif or B-box is annotated 
(bold, italics, underlined), as well as the L-box (bold, underlined), the EAR motif (bold, italics), 
and the two C2H2 zinc finger domains with QALGGH motif (bold and boxed). SEQ ID NO 10 
to SEQ ID NO 25 provides the sequences of various orthologs of the Arabidopsls thaliana STZ 
protein from other plant species. SEQ ID NO 26 to SEQ ID NO 35 provides the sequences of 

35 various paralogs (from Arabidopsis) of the STZ protein. SEQ ID NO 36 to SEQ ID NO 50 
provides the sequences of related 2xC2H2 genes and proteins useful in the methods of the 
present invention. 
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Fig. 4 is a photograph of T3 plants grown in a greenhouse (A) or in a field (B). The photograph 
shows yield Increase ( especially in aboveground biomass and plant height) in subsequent 
generations of STZ transformed plants. 

5 

Fig. 5 shows the binary vector for expression in Oryza saliva of the Arabidopsis thaliana STZ 
gene (CDS1536) under the control of a seed preferred WSI18 promoter (PRO0151). This 
vector contains a T-DNA derived from the Ti Plasmid, limited by a left border (LB repeat, LB Ti 
C58) and a right border (RB repeat, RB Ti C58)). 

10 The zinc finger protein expression cassette has a WSI18 (PRO0151 ) promoter and a double 
terminator sequence (T-zein and T-rbcS-deltaGA) located within the left border (LB repeat) 
and the right border (RB repeat) of the Ti plasmid. Cloned within these T-borders are also a 
screenable marker and a selectable marker, each under the control of a constitutive promoter 
(Prom), followed by a terminator sequence (poly a and t-NOS). Furthermore, this vector also 

15 contains an origin of replication (pBR322 (ori + bom)) for bacterial replication and a selectable 
marker (Sm/SpR) for bacterial selection. 

Examples 

The present invention will now be described with reference to the following examples, which 
20 are by way of illustration alone. 

DNA Manipulation 

Unless otherwise stated, recombinant DNA techniques are performed according to standard 
protocols described in Sambrook (2001) Molecular Cloning: a laboratory manual, 3rd Edition 
25 Cold Spring Harbor Laboratory Press, CSH, New York or in Volumes 1 and 2 of Ausubel ef a/. 
(1988), Current Protocols in Molecular Biology, Current Protocols. Standard materials and 
methods for plant molecular work are described in Plant Molecular Biology Labfase (1993) by 
R.D.D. Croy, published by BIOS Scientific Publications Ltd (UK) and Blackwell Scientific 
Publications (UK). 

30 

Example 1: Gene Cloning 

A gene encoding an STZ protein was amplified by PCR from an Arabidopsis thaliana seedling 
cDNA library (Invitrogen, Paisley, UK). After reverse transcription of RNA extracted from 
seedlings, the cDNAs were cloned into pCMV Sport 6.0. Average insert size of the bank was 
35 1 .5 kb, and original number of clones was of 1 .59x1 0 7 cfu. Original titer was determined to be 
9.6x10 s cfu/ml, after first amplification of 6x10 11 cfu/ml. After plasmid extraction, 200 ng of 
template was used in a 50ul PCR mix. Sequences of the primers used for PCR amplification 
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were, including the attB sites for Gateway recombination (in bold) were PRM3204 (sense, start 
codon in italics) 5' GGGGACAAGTTTGTACAAAAAAGCAGGCTTCACA4 7GGCG 
CTCGAGGCTC 3* (SEQ ID NO 3) and PRM3205 (reverse, complementary stop codon in 
italics) 5' GGGGACCACTTTGTACAAGAAAGCTGGGTAATTTCC7T4AAGTTGAAGTTTGA 
5 3' (SEQ ID NO 4). 

PCR was performed using Hifi Taq DNA polymerase in standard conditions. The PCR 
fragment (CDS1536) was amplified and purified using standard methods. The first step of the 
Gateway procedure, the BP reaction, was then performed, during which the PCR fragment 
10 was recombined in vivo with the pDONR pfasmid to produce, according to Gateway 
terminology, an "entry done", p3359. PDONR was purchased from Invitrogen, as part of the 
Gateway technology. 

Example 2: Vector construction for rice transformation with pGOS2::ArSTZ 
15 The entry clone p3359 was subsequently used in an LR reaction with p0640, a destination 
vector used for rice transformation. This vector contains as functional elements within the T- 
DNA borders a plant selectable marker and a Gateway cassette intended for LR in vivo 
recombination with the sequence of interest already cloned in the donor vector. Upstream of 
this Gateway cassette lies the rice GOS2 promoter for constitutive expression of the zinc finger 
20 gene (De Pater ef a/., Plant J. 2 (6) 837-844, 1992). After the recombination step, the resulting 
expression vector with the expression cassette CD4398 (Figure 1) was transformed into 
Agrobacterium strain LBA4404 and subsequently into plants. Transformed rice plants were 
allowed to grow and then examined for various parameters as described in Example 3. 

25 Example 3: Evaluation of TO, T1 and T2 transgenic rice plants transformed with 
pGOS2::AfSTZ (CD4398) 

Approximately 15 to 20 independent TO transformants were generated. The primary TO 
transformants were transferred from tissue culture chambers to a greenhouse for growing and 
harvest of T1 seed. Six events of which the T1 progeny segregated 3:1 for presence/absence 

30 of the transgene were retained. For each of these events, approximately 10 T1 seedlings 
containing the transgene (hetero- and homo-zygotes), and approximately 10 T1 seedlings 
lacking the transgene (nullizygotes), were selected by PCR. Based on the results of the T1 
evaluation three events were chosen, for further characterisation in the T2 generation, one 
event being very positive for a number of parameters, a second event being positive for a 

35 number of parameters, but less pronounced, and a third event being neutral. Seed batches 
from the positive plants (both hetero- and homozygotes) in T1, were screened by monitoring 
marker expression. For each chosen event, the heterozygote seed batches were then selected 
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for T2 evaluation. An equal number of positives and negatives within each seed batch were 
transplanted for evaluation in the greenhouse (i.e., for each event 40 plants were grown of 
which there were about 20 positives for the transgene and about 20 negative). Therefore, the 
total number for the three events amounted to 120 plants for evaluation in the T2 generation. 

5 

T1 and T2 plants were transferred to the greenhouse and evaluated for vegetative growth 
parameters and seed parameters, as described hereunder. 

ffl Statistical analysis of phanotvplc characteristics 

10 A two factor ANOVA (analyses of variance) was used as statistical model for the overall 
evaluation of plant phenotypic characteristics. An F-test was carried out on all the parameters 
measured, for all the plants of all the events transformed with the gene of interest The F-test 
was carried out to check for an effect of the gene over all the transformation events and to 
verify an overall effect of the gene or "global gene effect". Significant data, as determined by 

15 the value of the f-test, indicates a "gene" effect, meaning that the phenotype observed is 
caused by more than the presence or position of the gene. In case of the F-test, the threshold 
for significance for a global gene effect is set at 5% probability level. 

To check for an effect of the genes within an event, i.e., for a line-specific effect, a t-test was 
20 performed within each event using data sets from the transgenic plants and the corresponding 
null plants. "Null plants" or "Null segregants'are the plants treated in the same way as the 
transgenic plant, but from which the transgene has segregated. Null plants can also be 
described as homozygous negative transformant plants. The threshold for significance for the 
t-test is set at 10% probability level. Within one population of transformation events, some 
25 events may be under or above this t-test threshold. This is based on the hypothesis that a 
gene might only have an effect in certain positions in the genome, and that the occurrence of 
this position-dependent effect is not uncommon. This kind of gene effect may also be referred 
to as a "line effect of a gene". The p- value is obtained by comparing the t-value to the t- 
distribution or alternatively, by comparing the F-value to the f-distribution. The p- value stand 
30 for the probability that the null hypothesis (null hypothesis being "there is no effect of the 
transgene") is correct 

(II) Vegetative growth measurements 

The selected plants were grown in a greenhouse. Each plant received a unique barcode label 
35 to link unambiguously the phenotyping data to the corresponding plant. The selected plants 
were grown on soil in 10 cm diameter pots under the following environmental settings: 
photoperiod= 11.5 h, daylight intensity= 30,000 lux or more, daytime temperature= 28°C or 
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higher, night time temperature= 22°C, relative humidity= 60-70%. Transgenic plants and the 
corresponding nullizygotes were grown side-by-side at random positions. From the stage of 
sowing until the stage of maturity (which is the stage were there is no more increase in 
biomass) the plants were passed weekly through a digital imaging cabinet (examples of 
5 pictures are shown in Figures 2A and 2B). At each time point digital images (2048x1 536 pixels, 
16 million colours) were taken of each plant from at least 6 different angles. The parameters 
described below were derived in an automated way from the digital images using image 
analysis software. 

1 0 (a) Aboveground area 

Plant above ground area was determined by counting the total number of pixels from 
aboveground plant parts discriminated from the background. This value was averaged for the 
pictures taken on the same time point from the different angles and was converted to a 
physical surface value expressed in square mm by calibration. Experiments show that the 

15 aboveground plant area measured this way correlates with the biomass of plant parts above 
ground. 

Results of the maximum above ground area values of the lines selected for T2 evaluation are 
summarized in Table 1. The plants of the best performing line showed an increase in biomass 
20 of 34 % , compared to the nullizygotes. 

When an F-test was carried out on all the plants of all the T2 events it became clear that the 
transgenic plants show a significant increase in above ground area, in average an increase of 
approximately 18%. A significant increase in above ground biomass is also displayed by STZ 
transformed plants grown under field conditions (see figure 4). 

25 

Table 1: Aboveground area of STZ transgenic T2 plants. Each row corresponds to one event, 
for which the average maximum aboveground area (expressed in mm") was determined for the 
transgenics (TR) and the null plants (null). The difference in absolute values between the 
transgenic population and the nullizygotes of each event are presented (dif.) as well as the 
30 percentage of difference between the two populations (% dif). P stands for the probability 
produced by the t-test for each event. The last row presents the average numbers calculated 
from all the events. Here the p-value is produced by the F-test 
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Total above ground Area Max (mm*) 


Une 


TR 


null 


dif 


% dif 


p-value 


CD4396 LI 


63947 


4760S 


16341 


34 


0.0021 


CD4396 L2 


42509 


41342 


1167 


3 


0.8063 


CD4396 L3 


41116 


33687 


7429 


22 


0.1107 


Overall 


49178 


41657 


7522 


18 


0.0047 



(b) Plant height measurements 

Plant height was determined by the distance between the horizontal lines going through the 
5 upper pot edge and the uppermost pixel corresponding to a plant part above ground. This 
value was averaged for the pictures taken on the same time point from the different angles and 
was converted, by calibration, to a physical distance expressed in mm. Experiments showed 
that plant height measured this way correlate with plant height measured manually with a ruler. 

10 The increase In plant height was displayed very clearly in STZ transformed plants when 
measured at the end of the vegetative growth (see figure 4A). Also, this parameter, was 
displayed by STZ transformed plants when grown in the field (see figure 4B) at the time of 
harvest 

1 5 (c) Total area cycle time measurements 

Plants were imaged weekly along the complete cell cycle and the maximum total area of the 
plants was determined as mentioned above. Total Area Cycle Time is the time when a plant 
reaches 90% of its maximum total area. This parameter is an indication of the duration of the 
vegetative growth. 

20 Only In some transgenic lines there was an effect on cycle time. These few lines showed a 
prolonged vegetative growth. 

(Ill) Measurement of seed-related parameters 

The mature primary panicles were harvested, bagged, barcode-labelled and then dried for 
25 three days in the oven at 37°C. The panicles were then threshed and all the seeds collected. 
The filled husks were separated from the empty ones using an air-blowing device. After 
separation, both seed lots were then counted using a commercially available counting 
machine. The empty husks were discarded. The filled husks were weighed on an analytical 
balance and the cross-sectional area of the seeds was measured using digital imaging. This 
30 procedure resulted in the set of seed-related parameters described below. 
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(a) Total number of filled seeds per plant 

The number of filled seeds was determined by counting the number of filled husks that 
remained after the separation step. 

S Total numbers of filled seeds per plant are summarized In Table 2. The t-test shows that 
for two events, transgenic plants produce 106% and 130% more filled seeds than the 
nullizygotes. 

Table 2: Number of filled seeds of STZ transgenic T2 plants. Each row corresponds to one 
10 event, for which the average number of filled seeds was determined for the transgenics (TR) 
and the null plants (null). The difference in absolute values between the transgenic population 
and the nullizygotes of each event are presented (dif.) as well as the percentage of difference 
between the two populations (% dif). P stands for the probability produced by the t-test for 
each event. The last row presents the average numbers calculated from all the events. Here 
15 the p-value is produced by the F-test 



Number of filled seeds 


Une 


TR 


null 


dif 


%dif 


p-value 


CD4396 LI 


387.9 


188.7 


199.19 


106 


<0.0001 


CD4396 L2 


163.8 


156.5 


7.22 


5 


0.8382 


CD4396 13 


236.9 


102.9 


133.98 


130 


0.0004 


Overall 


264.9 


159.7 


105.25 


66 


<0.0001 



(b) Total seed weight per plant 

The total seed weight was measured by weighing all filled husks harvested from a plant. 

20 The total seed weight values of STZ transformed plants are summarized in Table 3. STZ 
transgenic plants produce significantly more seed weight than the corresponding nullizygotes. 
The difference in seed weight of the transgenics may be as high as 138% or higher. 

Table 3: Total seed weight per plant of STZ transgenic T2 plants. Each row corresponds to 
25 one event, for which the average total seed weigh (in gram) was determined for the 
transgenics (TR) and the null plants (null). The difference In absolute values between the 
transgenic population and the nullizygotes of each event are presented (dif.) as well as the 
percentage of difference between the two populations (% dif). P stands for the probability 
produced by the t-test for each event. The last row presents the average numbers calculated 
30 from all the events. Here the p-value is produced by the F-test. 
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Total weight of seeds 


Line 


TR 


null 


dif 


%dff 




CD4396 LI 


9.8 


4.5 


5.25 


116 


<0.0001 


CD4396 L2 


3.4 


3.3 


0.1 


3 


0.908 


CD4396 L3 


6.1 


2.6 


3.56 


138 


0.0001 


Overall 


6.5 


3.7 


2.75 


74 


<0.0001 



(c) Harvest Index 

The harvest index in the present invention is defined as the ratio between the total seed yield 
and the above ground area (mm 2 ), multiplied by a factor 10 6 . 

5 

The harvest index values of the STZ-transgenic plants are summarized in Table 4. STZ 
transgenic plants have a significant increase in harvest index. The increase in harvest index of 
the transgenic plants may be as high as 66%, when compared to the corresponding 
nullizygotes. 

10 

Table 4: Harvest index of STZ transgenic T2 plants. Each row corresponds to one event, for 
which the average harvest index was determined for the transgenics (TR) and the null plants 
(null). The difference in absolute values between the transgenic population and the 
nullizygotes of each event are presented (dif.) as well as the percentage of difference between 
15 the two populations (% dif). P stands for the probability produced by the t-test for each event. 
The last row presents the average numbers calculated from all the events. Here the p-value Is 
produced by the F-test. 



Harvest Index 


Line 


TR 


null 


dif 


% dif 




CD4396 LI 


149.1 


90 


59.11 


66 


<0.0001 


CD4396 L2 


74 


73.4 


0.55 


1 


0.9574 


CD4396 13 


121.3 


75.9 


45.32 


60 


<0.0001 




114.8 


82.6 


32.16 


39 


<0.0001 



(d) Thousand kernel welgth (TKW) of plants 
20 Thousand Kernel Weight (TKW) is a parameter extrapolated from the number of filled seeds 
counted, and their total weight. 

The weight values of thousand kernels of STZ transgenic plants are presented in Table 5. STZ 
transgenic plants have increased thousand kernel weight. The increase of TKW of transgenic 
plants may be as high as 6% when compared to the corresponding nullizygotes. 
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Table 5: Thousand kernel weight ofSTZ transgenic T2 plants. Each row corresponds to one 
event, for which the average TKW was determined for the transgenics (TR) and the null plants 
(null). The difference in absolute values between the transgenic population and the 
nullizygotes of each event am presented (dif.) as well as the percentage of difference between 
the two populations (% dif). P stands for the probability produced by the t-test for each event. 
The last row presents the average numbers calculated from all the events. Here the p-value is 
produced by the F-test 



p-value 



(e) Total number of seeds 

The total number of seeds per plant was measured by counting the number of husks harvested 
from a plant. 



The total numbers of seeds per plant are summarized In Table 6. STZ transformed plants have 
15 an increase in total number of seeds. The increase of total number of seeds maybe as high as 
68%, when compared to the corresponding nullizygotes. 



Table 6: Total number of seeds of STZ transgenic T2 plants. Each row corresponds to one 
event, for which the average total number of seeds was determined for the transgenics (TR) 
20 and the null plants (null). The difference in absolute values between the transgenic population 
and the nullizygotes of each event are presented (dif.) as well as the percentage of difference 
between the two populations (% dif). P stands for the probability produced by the t-test for 
each event The last row presents the average numbers calculated from all the events. Here 
the p-value is produced by the F-test. 



Total number of seeds 


Une 


TR 


null 


dif 


% dif 


p-value 


CD4396 LI 


483.5 


367.4 


116.03 


32 


0.0146 


CD4396 L2 


353.9 


327.5 


26.42 


8 


0.5473 


CD4396 L3 


383.6 


228.2 


155.48 


68 


0.0009 


Overall 


406 


312.5 


93.52 


30 


0.0002 
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Conclusion 

It may be concluded that vegetative growth Is increased in the STZ transgenic plants when 
compared to the control non -transgenic plants, as reflected by parameters such as above 
5 ground area, where the increase is above 20 %. This effect may be attributed to the 
expression of the STZ gene in the transgenic plants. Additionally, In some transformation 
events, the length of the vegetative growth is altered in the STZ transgenic plants. For those 
transformation events in which this effect occurs, in average the vegetative growth was 
prolonged with about 4 to 6 days, under the conditions tested. 

10 Furthermore, yield was increased in STZ transgenic plants. Several seed parameters reflect 
this yield increase. The total number of seeds harvested was at least 100% higher in the 
transgenics than in the control plants, for those events showing a differential. For these 
events, there was also an increase in the total number of seeds of the transgenics, which 
increase was higher than 30 %. Seed filling in those transgenics was greatly improved, 

15 reaching differences above 1 00% in the number of filled seeds. 

Seed of the transgenic plants were also heavier, and probably bigger, as suggested by the 
higher values obtained for the thousand kernel weigh. The TKW parameter is a very stable 
parameter in rice cultivara, such as nipponbare, and in the growth conditions here used. This 
means that this parameter is not easily influenced and makes it an important yield parameter. 

20 Therefore a TKW Increase of 6 % represents a significantly increase In yield. 

Harvest index, another important yield parameter, was increased in the transgenic plants with 
more than 50 %. 

In summary, based on the evaluation of STZ transgenic plants in the T1, T2 and further 
generations, it may be concluded that the presence of an STZ transgene, has a positive effect 
25 on the size of the plant and/or its organs, as well as a positive effect on the final yield 
harvested. 

(Ill) Root growth measurements 

Transgenic plants are grown next to their corresponding non-transgenic null segregant in 
transparent pots. In average, for each construct comprising a particular promoter-2xC2H2 
30 combination, a minimum of 5 independent transformation events are evaluated for root growth, 
root development and root architecture. Typically, per transformation event, 10 transgenics are 
compared to 10 nullizygotes. Root pictures are taken weekly during plant growth. The pictures 
are processed and analyzed to extract the values for the root parametes as detailed below. 
Statistical analysis as described above are applied to these data. 

35 
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a) Root Area 

Total root area is calculated from the summed number of pixels of each root images. A positive 
linear correlation between root area and dry weight and root biomass of the root has been 
previously established by similar experiments. Therefore, root area Is a good approximation for 
5 root biomass. 

b) Root Length 

The total perimeter of the roots of a plant is calculated as the sum of the perimeter of all roots 
in the images. A linear correlation between this measurement and root length has been 
10 previously established. Thus, root length is extrapolated from the total root perimeter. 

c) Root Width 

Average root width of a plant is expressed as the ratio between the Root Area and the Root 
Length. 

15 

STZ transgenic plants of the invention show a superior performance when compared to control 
pants. Transgenic plants are altered in one or more the root parameters detailed above. In 
particular the transgenic have increased root biomass, for example due to increased root dry 
weight or area, and/or increased root length and/or increased root width. 

20 

Example 4: Leaf Blade Width Measurement 

Leaves of STZ transgenic plants appeared bigger and wider when compared to the 
corresponding control non-transgenic plants. To quantify the increase In leaf width, leaf blade 
width (length of transversal axe) of the flag leaf was measured with a ruler at the widest point 
25 of the leaf, which is approximately at half of the length, In plants that have reached the end of 
the vegetative growth phase. The results shown in the Table 7, indicate that the increase in 
the leaf blade width in at least the event here measured was around 15 % when compared to 
the corresponding nullizygote. 

30 Table 7: Leaf blade width of STZ transgenic T2 plants. The average leaf blade width was 
determined for the transgenics (TR) and the null plants (null) of the selected event. The 
difference in absolute values between the transgenic population and the nullizygotes of the 
event is presented (d'rf.) as well as the percentage of difference between the two populations 
(% d'rf). P stands for the probability produced by the t-test . 
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Example 5: Vector construction for rice transformation with pWSI18::AtSTZ 

Vector construction for transformation with the pWSI18 (PRO0151) - AtSTZ (CDS1536) 
cassette was carried out essentially as in example 2. The entry clone p3359, described earlier, 
was subsequently used in an LR reaction with p05653, a destination vector used for rice 
transformation. This destination vector contains as functional elements within the T-DNA 
borders a plant selectable marker and a Gateway cassette intended for LR in vivo 
recombination with the sequence of interest already cloned in the donor vector. A WSI18 
promoter for seed preferred expression (PRO0151) is located upstream of this Gateway 
cassette. After the recombination step, the resulting expression vector with the expression 
cassette CD4398 (Figure 5) was transformed into Agrobacterium strain LBA4404 and 
subsequently this vector was transformed to Oryza sativa plants. Transformed rice plants were 
allowed to grow and then examined for various parameters as described in example 3. 



Example 6: Evaluation of TO and T1 transgenic rice plants transformed with the seed 
preferred expression cassette pWS1 1 8: : AtSTZ (CD4398) 

Preparations of calli and of the Agrobacterium tumefaciens strain containing the expression 
vector with the CD4398 expression cassette, were carried out as described in example 3, as 
were the calli transformation and plant regeneration. 



Approximately 15 to 20 independent TO rice transformants were generated. The primary 
transformants were transferred from tissue culture chambers to a greenhouse for growing and 
harvest of T1 seed. Events, of which the T1 progeny segregated 3:1 for presence/absence of 
25 the transgene, were retained. For each of these events, approximately 10 T1 seedlings 
containing the transgene (hetero- and homo-zygotes), and approximately 10 T1 seedlings 
lacking the transgene (nullizygotes), were selected by monitoring marker expression. 
Transgenic plantlets were grown next to control nullyzygotes, seeds were harvested and 
thousand kernel weight determined as previously described. 

30 

Transformed plants comprising the expression cassette CD8490 (seed preferred 
pWSI18::STZ), had a normal and healthy appearance and were harvested at the same time as 
the control plants. The seeds harvested from the transgenic plants had an Increase in 
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thousand kernel weight when compared to the control plants. As shown in Table 8 increase in 
thousand kernel weight was above 10%. 

Table 8: Thousand kernel weight ofSTZ transgenic T1 plants. The average I thousand kernel 
5 weight was determined for the transgenics (TR) and the null plants (null) of the selected event. 
The difference in absolute values between the transgenic population and the nullizygotes of 
the event is presented (dif.) as well as the percentage of difference between the two 
populations (% dif). P stands for the probability produced by the t-test . 



Example 7: Cloning, transformation and evaluation of other 2xC2H2 encoding genes. 

In Table 9 an overview is given of constructs with STZ or other 2xC2H2 zinc finger proteins, 
under control of various promoters, which constructs are made for use in the methods of the 
present invention. The coding regions of the 2xC2H2 genes to be cloned (GOI, Gene of 
Interest) are amplified by PCR from cDNA , following the protocol as in Example 1. Specific 
primers for each 2xC2H2 gene were designed at the start and stop codons of the gene 
sequence as present in the public database under the accession number as indicated in Table 
9. These cloned sequences are also herein incorporated under the SEQ ID NO number as 
mentioned in the table. Moreover, the isolated PCR fragments were also given a unique CDS 
number. 

The PCR fragment with a 2xC2H2 gene is then cloned under the control of a particular 
promoter. Different combinations for different genes are made (see Table 9). Chimeric 
constructs are made and CD numbers represent bacterial strains carrying the chimeric 
construct Corresponding transgenic plants are obtained by transforming the plants with the 
chimeric constructs, following the protocols as mentioned herein before. Evaluation of the 
transgenic events reveals an increase in yield, and increase in leaf surface area and/or an 
increase in duration of vegetative growth in the transgenic plants when compared to the control 
non-transgenic plants. 
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Table 9: examples of 2xC2H2 chimeric constructs useful for the methods of the present invention. *see Tj 



cos 


Accession number 
(cDNA on which primers 
were designed to amplify 
the CDS region) 


Prot ACC number 


SEQ ID NO 


PRO0129* 


PRO017Q* 


PRO0Q81_2* 


PRO0123* 


PRO0207* 


PRO011I 


CDS1536 
STZ 

Arabldopsls 


X95573 


CAA64820 


1+2 


CD4398 


CD11371 


CD11382 


CD10960 


CD10B59 


CD1031; 


CDS2200 
Arabldopsls 


AF022658NM_120516 


AAH8D922.1At5g 04340 


28 + 29 


CD11576 






CD11413 




CD1154I 


CDS2205 
Arabldopsls 


NM_1 23683 


At5g43170 




CD11325 






CD11414 




CD1138- 


CDS2776 
Ortholog 


AF332876 


AAK01713.1 


38 + 37 


CD09948 










CD1031! 


CDS1677 
Homolog 
Arabldopsls 


AL132966 REGION: 
116202..1 16729 


CAB67667 


38 + 39 


CD08462 






CD 




CD 


CDS3337 
Homol09 
Suqarcane 


CA279020 




40 


CD 






CD 




CD 


CDS2416 
Homolog 
Arabldopsls 


AF254447 


At3g57670 


41+42 


CD 






CD 




CD 


CDS2377 
Homolog 
Arabldopsls 


AJ311810 


CAC86167 


43 + 44 


CD 






CD 




CD 


CDS 

Homolog 

Arabldopsls 


AL355775 REGION: 
compIement<7957..8451 ) 


CA890935 


46 + 46 


CD 






CD 




CD 


CDS 

Homolog 

Arabldopsls 


AL391 143 REGION: 
comptement(31730..3293B) 


CAC01747 


47 + 48 


CD 






CD 




CD 


CDS3641 
Homolog 
Arabldopsls 


X98678 


CAA67236 


49 + 50 


CD 






CD 




CD 
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Table 10: examples promoters used in combination with 2xC2H2 for the methods of the 
present invention. 



Promoter 


Preferred expression type 


Origin species 


Gene 


PRO0151 


Seeds (mainly embryo and 
aleurone). Strong expression. 


Oryza sativa 


WSI18 


PRO0110 


Root 


Oryza sativa 


RCc3 


PRO0207 


Green tissue. Moderate expresssion 
evels 


Ssccfrarum officinsrum 


Prp 


PRO0123 


Green tissue. Strong expression 
evels. 


Oryza sativa 


Protochlorophyllide 
reductase 


PRO0090 


Seed specific (mainly endosperm) 


Oryza sativa 1 


Prolamin RP6 


PRO0170 


Constitutive. Strong Expression. 


Oryza sativa 


High Mobility 
Group protein 


PRO0218 


Seeds (mainly embryo and 
aleurone) 


Oryza sativa 


oleosine 18kda 


PRO0061_2 


Young expanding tissues 


Oryza sativa 


beta-expansine 
EXPB9 


PRO0129 


Constitutive. High expression levels. 


Oryza sativa 


GOS2 



5 

Example 8: use of the Invention in corn. 

The methods of the invention described herein are also used in maize. To this aim, an STZ 
encoding gene, for example a maize or other STZ ortholog, is cloned under control of a 
promoter operable in maize, in a plant transformation vector suitable for Agrobacterium- 
10 mediated corn transformation. Methods to use for com transformation have been described in 
literature (Ishida et al., Nat Biotechnol. 1996 Jun;14(6):745-50; Frame et al., Plant Physiol. 
2002 May;129(1):13-22). 

Transgenic plants made by these methods are grown in the greenhouse for T1 seed 
15 production. Inheritability and copy number of the transgene are checked by quantitative real- 
time PCR and Southern blot analysis and expression levels of the transgene are determined 
by reverse PCR and Northern analysis. Transgenic lines with single copy insertions of the 
transgene and with varying levels of transgene expression are selected for T2 seed 
production. 

20 

Progeny seeds are germinated and grown in the greenhouse in conditions well adapted for 
maize (16:8 photoperiod, 26-28°C daytime temperature and 22-24°C nighttime temperature) 
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as well under water-deficient, nitrogen-deficient, and excess NaCI conditions. Null segregants 
from the same parental line, as well as wild type plants of the same cultivar are used as 
controls. The progeny plants resulting from the selfing or the crosses are evaluated on different 
blomass and developmental parameters, including, plant height, stalk/stem thickness, stem 
5 size, number of leaves, total above ground area, leaf greenness, time to maturity, time to 
silking, flowering time, time to flower, ear number, ear length, row number, kernel number, 
kernel size , kernel oil content, grain maturity, harvesting time. The seeds of these lines are 
also checked on various parameters, such as grain size, total grain yield per plant, and grain 
quality (starch content, protein content and oil content). 

10 

Lines that are most significantly improved compared to corresponding control lines are 
selected for further field-testing and marker-assisted breeding, with the objective of transferring 
the field-validated transgenic traits into commercial germplasm. The testing of maize for 
growth and yield-related parameters In the field is conducted using well-established protocols. 
15 The corn plants are particularly evaluated on yield parameters, such as for example, amount of 
plants per acre, amount of ears per plant, amount of rows per ear, amount of seeds per row 
and TKW. Subsequent improvements for introgressing specific loci (such as transgene 
containing loci) from one germplasm into another is also conducted using well-established 
protocols. 

20 
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Claims 

1. Method for increasing plant yield relative to corresponding wild type plants, comprising 
modifying expression in a plant of a nucleic acid sequence encoding a 2xC2H2 zinc finger 

5 protein and/or modifying in a plant level and/or activity of a 2xC2H2 zinc finger protein. 

2. Method for increasing leaf surface area relative to corresponding wild type plants, 
comprising modffying expression in a plant of a nucleic acid sequence encoding a 2xC2H2 
zinc finger protein and/or modifying in a plant level and/or activity of a 2xC2H2 zinc finger 

10 protein. 

3. Method for prolonging vegetative growth phase of a plant relative to corresponding wild 
type plants, comprising modifying expression in a plant of a nucleic acid sequence 
encoding a 2xC2H2 zinc finger protein and/or modifying in a plant level and/or activity of a 

15 2xC2H2 zinc finger protein. 

4. Method according to any of claims 1 to 3. wherein said modifying expression, level and/or 
activity is effected by recombinant means and/or chemical means. 

20 5. Method according to any of claims 1 to 4, wherein said 2xC2H2 zinc finger protein 
comprises a QALGGH motif. 

6. Method according to any of claims 1 to 4, wherein said 2xC2H2 zinc finger protein 
comprises a NNM(W)QMH motif. 

25 

7. Method according to any of claims 1 to 6, wherein said 2xC2H2 zinc finger protein 
comprises an EAR motif. 

8. Method according to any of claims 1 to 7, wherein said 2xC2H2 zinc finger protein further 
30 comprises a B-box. 

9. Method according to any of claims 1 to 8, wherein said 2xC2H2 zinc finger protein further 
comprises an L-box. 

35 10. Method according to any of claims 1 to 9, wherein said 2xC2H2 zinc finger protein is 
derived from a dicotyledonous plant, preferably from the family Brassicaceae, further 
preferably from Arabldopsis thaliana, more preferably the nucleic acid is as represented by 
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SEQ ID NO 2 or a homologue, derivative or active fragment thereof and/or wherein said 
nucleic acid is as represented by SED ID NO 1 or a portion thereof or sequences capable 
of hybridising therewith . 

5 11. Method according to claim 10, wherein said homologue, derivative or active fragment has, 
in increasing order of preference, at least 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 
38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%. 52%, 54%. 
56%, 58%, 60%, 62%, 64%, 66%, 68%, 70%, 72%, 74%, 76%, 78%, 80%, 82%, 84%, 
86%, 88%. 90%, 92%, 94%, 96%, 98% sequence identity with the sequence of SEQ ID NO 
10 2. 

12. Method according to any of claims 1 to 1 1 , wherein said plant is a monocot. 

13. Method according to any of claims 1 to 12, wherein said modifying expression Is effected 
15 by Introducing into a plant a nucleic acid capable of modifying expression of a gene 

encoding a 2xC2H2 zinc finger protein and/or capable of modifying level and/or activity of a 
2xC2H2 zinc finger protein. 

14. Method according to claim 13, wherein said nucleic acid capable of modifying expression is 
20 a nucleic acid encoding a 2xC2H2 protein, such as a 2xC2H2 protein as defined in any of 

claims 5 to 11. 

15. Method according to claims 1 3 or 14, wherein said nucleic acid introduced into a plant is an 
alternative splice variant of a nucleic acid as defined in claim 14. 

25 

16. Method according to claims 13 or 15, wherein said nucleic acid introduced into a plant is an 
allelic variant of a nucleic acid as defined in claim 14. 

17. Method according to claims 13 or 16, wherein said nucleic acid introduced into a plant is 
30 comprised on at least part of a chromosome. 

18. Method according to any of claims 1 to 17, wherein said modifying expression comprises 
increased expression. 

35 19. Method according to any of claims 1 to 18, wherein expression of said nucleic acid is 
driven by a plant promoter, preferably a constitutive promoter, such as a GOS2 promoter. 
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20. Method according to any of claims 1 to 18, wherein expression of said nucleic acid is 
driven by a plant promoter, preferably a tissue preferred promoter, such as seed-preferred 
promoter. 

5 21. Method according to any of claims 1 to 20, wherein said increased yield comprises 
increased above ground biomass. 

22. Method according to any of claim 1 to 20, wherein said increased yield comprises 
increased seed yield. 

10 

23. Method according to any of claim 1 to 20, wherein said increased yield comprises 
increased root yield. 

24. Construct comprising: 

15 (i) A nucleic acid capable of modifying expression of a nucleic acid encoding a 

2xC2H2 zinc finger protein and/or capable of modifying level and/or activity of a 
2xC2H2 zinc finger protein; 
(ii) One or more plant control sequence capable of driving expression of the 
nucleic acid sequence of (i); and optionally 

20 (iii) A transcription termination sequence. 

25. Construct according to claim 24, wherein said nucleic acid of (i) is a nucleic acid as defined 
in any of claims 1 4 to 1 7. 

25 26. Construct according to claim 24 or 25, wherein said control sequences of (fi) is at least a 
constitutive promoter, such as a GOS2 promoter. 

27. Construct according to claim 24 or 25, wherein said control sequences of (ii) is at least a 
tissue preferred promoter, such as seed-preferred promoter. 

30 

28. Host cell comprising a construct according to any of claims 24 to 27. 

29. Method for the production of a transgenic plant having increased yield, increased leaf 
surface area and/or prolonged vegetative growth, which method comprises 

35 (i) introducing into a plant or plant cell a 2xC2H2 zinc finger nucleic acid; 

(ii) Cultivating the plant or plant cell under conditions promoting plant 
growth. 
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30. Plant obtainable by a method according to any of claims 1 to 23 and 29, which plant has 
increased yield, modified leaf surface area and/or prolonged vegetative growth, relative to 
corresponding wild type plants. 

5 

31. Transgenic plant having increased yield, increased leaf surface area and/or prolonged 
vegetative growth, which transgenic plant has modified expression of a nucleic acid 
encoding a 2xC2H2 zinc finger protein and/or modified level and/or activity of a 2xC2H2 
zinc finger protein, relative to corresponding wild type plants. 

10 

32. Plant part, preferably a harvestable part, a propagule or progeny of a plant as defined in 
claim 30 or 31, which progeny has modified expression of a nucleic acid encoding 2xC2H2 
zinc finger protein and/or modified level and/or activity of a 2xC2H2 zinc finger protein, 
relative to corresponding wild type plants. 

15 

33. Plant or plant part according to any of claims 30 to 32, which plant is a monocotyledonous 
plant, preferably a cereal. 

34. Plant or plant part according to any of claims 30 to 33 selected from rice, maize, wheat, 
20 barley, millet, oats, rye, sorghum, soybean, sunflower, canola, sugarcane, alfalfa, 

leguminosae (bean, pea), flax , lupinus, rapeseed, tobacco, tomato, potato, squash, 
papaya, poplar and cotton. 

35. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
25 construct as defined in any of claims 24 to 27 to increase plant yield. 

3G. A yield regulating composition comprising a nucleic acid encoding a 2xC2H2 protein, and/ 
or comprising a 2xC2H2 protein, and/or comprising a construct as defined In any one of 
claims 24 to 27. 

30 

37. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
construct as defined in any of claims 24 to 27 to increase leaf surface area. 

38. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
35 construct as defined in any of claims 24 to 27 to prolong vegetative growth. 
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39. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
construct as defined in any of claims 24 to 27 as target for an agrochemical. 

40. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
5 construct as defined in any of claims 24 to 27 in a breeding program. 

41. Use of a plant as defined in any of claims 30 to 34 to produce enzymes, pharmaceuticals 
or agrochemicals. 

10 42. Use of a plant as defined in any one of claims 30 to 34 to produce food or feed products. 
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SEQ ID NO 1: Arabidopsis thaliana SXZ cDNA (CDS1536) 

AATGGCGCTCGAGGCTCTTACATCACCAAGATTAGCTTCTCCGATTCCTCCTTTGTTCGAAG 
ATTCTTCAGTCTTCCATGGAGTCGAGCACTGGACAAAGGGTAAGCGATCTAAGAGATCAAGA 
TCCGATTTCCACCACCAAAACCTCACTGAGGAAGAGTATCTAGCTTTTTGCCTCATGCTTCT 
CGCTCGCGACAACCGTCAGCCTCCTCCTCCTCCGGCGGTGGAGAAGTTGAGCTACAAGTGTA 
GCGTCTGCGACAAGACGTTCTCTTCTTACCAAGCTCTCGGTGGTCACAAGGCAAGCCACCGT 
AAGAACTTATCACAGACTCTCTCCGGCGGAGGAGATGATCATTCAACCTCGTCGGCGACAAC 
CACATCCGCCGTGACTACTGGAAGTGGGAAATCACACGTTTGCACCATCTGTAACAAGTCTT 
TTCCTTCCGGTCAAGCTCTCGGCGGACACAAGCGGTGCCACTACGAAGGAAACAACAACATC 
AACACTAGTAGCGTGTCCAACTCCGAAGGTGCGGGGTCCACTAGCCACGTTAGCAGTAGCCA 
CCGTGGGTTTGACCTCAACATCCCTCCGATCCCTGAATTCTCGATGGTCAACGGAGACGACG 
AAGTCATGAGCCCTATGCCGGCGAAGAAGCCTCGGTTTGACTTTCCGGTCAAACTTCAACTT 
TAAGGAAATT 

SEQ ID NO 2: Arabidopsis thaliana STZ protein with annotation 
of the domains 

B-BOX L-BOX 
MAIiEALTSPRIASPIPPLF EDSSWHGVEHWTKGKRSKRSJ^ DFHHQNLT EEEYIAFCIiMLI, 
ARDNRQPPPPPA VEKLSYKlCSVCDKTFSSYQAIiGGEKASH| RKNIiSQTLSGGGDDHSTSSATT 
TSAVTTGSGKSHV |C^a^FPSGOAI^HKRCH| YEGNNNINTSSVSNSEGAGSTSHVSSSH 
RGFDiWIPPIPEFSMVNGDDEVMSPMPAKKPRFDFPVKLQL 
EAR motif 

SEQ ID NO 3: PRM3204 (sense, start codon in italics) 

5' GGGGACAAGTTTGTACAAAAAAGCAGGCTTCACA-arGGCGCTCGAGGCTC 3' 

SEQ ID NO 4: PRM3205 (reverse, complementary stop codon in 
italics) 

5' GGGGACCACTTTGTACAAGAAAGCTGGGTAATTTCCrrAAAGTTGAAGTTTGA 3' 
BOXES 

SEQ ID NO 5: QALGGH motif 
QALGGH 

SEQ ID NO 6: NNM motif 
NN (M/W) QMH 

SEQ ID NO 7: EAR motif 
hDLNh (X) P 

SEQ ID NO 8 : B-Box 
KR (S) KRXR 

SEQ ID NO 9: L-Box 
EXEXXAXCLXXL 

FIGURE 3 
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ORTHOLOGS OP STZ ±n OTHER. PLANT SPECIES 



SEQ ID NO 10: Dg_AF119050_l Datisca glomerata zinc-finger 
protein 1 (zfpl) mRNA, complete cds 

GGCACGAGGACAAATTCTCTCTCTATCCTCTGAATATCTTTGGTTTGTGAACTGAGAAGCTA 
TTAGATGGCTCTAGAAGCGCTCAACTCTCCGACCACAGCTACGCCGGTGTTTCACTACGACG 
ACCCCAGCTTGAATTACCTTGAGCCATGGACCAAGCGTAAGCGTTCCAAGCGTACGCGCTTA 
GATAGCCCCATACCGAGGAAGAGTACCTTGCTTTCTGCCTCATCATGCTCGCTCGTGGCCGC 
GTTGCCTCTGCAAATCGACGGGATTCTCAGTCTTCCATTCAGATTCAGCCTGAAGCAACGAC 
TTCGGCTACCAAAGTCAGTTATAAGTGCTCTGTGTGCGATAAGGCCTTTTCGTCTTATCAGG 
CTTTGGGTGGGCACAAGGCCAGCCACAGAAAGCTCGCTGGCGGCGAAGATCAATCGACTTCC 
TTTGCCACCACGAATTCAGCCACCGTCACTACCACCACAGCCTCCGGAGGTGGTGGCAGGTC 
TCATGAGTGTTCTATTTGCCACAAATCGTTCCCGACTGGCCAGGCCTTGGGTGGTCACAAGC 
GCTGCCACTACGAAGGCAGTATCGGCGGCAATAGTATTCACCACCACAACAATACCACCAAC 
AGCGGAAGCAACGGTGGCATGAGCATGACCTCCGAAGTAGGTTCCACACACACAGTCAGCCA 
CAGTCACCGTGACTTCGATCTCAACATCCCGGCCTTGCCGGAGTTTCGGTCGAATTTCTTCA 
TATCCGGGGATGACGAGGTCGAGAGTCCTCATCCGGCCAAGAAACCCCGTATATTGATGAAA 
TAAAACATTTCTCAAGATCACTGAACCAGGCTTTAGTTTCTTTATAGGAGGAGATTTAAAAA 
AGTAGTATCTCTCTTTCTTTATCCGTAGGATAATTAATATATTTCGTGTACATAAATTTGTA 
GTTCTTTAACACACTCTGTTTCATTTTCTTGCTTTGCTCAACTTTGTATTGGTTATTTCATT 
ATGAAAATTCAATT 

SEQ ID NO 11: Dg_AF119050_l Datisca glomerata, STZ ortholog, 
protein 

MALEALNSPTTATPVFHYDDPSLNYLEPWTKRKRSKRTRLDSPHTEEEYIiAFCLIMIjARGRV 
ASANRRDSQSSIQIQPEATTSATKVSYKCSVCDKAFSSYQALGGHKASHRK1AGGEDQSTSF 
ATTNSATVTTTTASGGGGRSHECSICHKSFPTGQALGGHKRCHYEGSIGGNSIHHHNNTTNS 
GSNGGMSMTSEVGSTHTVSHSHRDFDLNIPALPEFRSNFFISGDDEVESPHPAKKPRILMK 

SEQ ID NO 12:Gm_T09602_U68763.1_GMU68763 Glycine max (soybean) 

probable zinc finger protein SCOF-1 mRNA, complete cds 

AAAATTCTCACTCTCTCTCTCATCTCGAGATCATAGTATCATATTCAATATCATTTCATACC 

AAACACATGGCTTTGGAAGCTCTCAACTCACCAACAACAACCGCTCCATCTTTTCCCTTTGA 

CGACCCAACTATTCCATGGGCGAAACGAAAACGTTCAAAGCGTTCTCGCGACCATCCTICTG 

AAGAAGAGTACCTCGCCCTCTGCCTCATCATGCTCGCTCGCGGCGGCACCACCACCGTCAAC 

AACCGCCACGTCAGCCCTCCGCCGCTACAGCCACAGCCACAGCCGACACCAGATCCTTCCAC 

CAAGCTCAGTTACAAATGCTCCGTTTGCGACAAGAGCTTCCCCTCTTACCAAGCGCTCGGTG 

GACACAAGGCCAGTCACCGGAAACTCGCCGGCGCCGCCGAAGACCAACCCCCCAGCACCACC 

ACTTCCTCCGCCGCCGCCACCAGCTCCGCCTCCGGAGGTAAGGCCCATGAGTGCTCCATTTG 

CCACAAATCCTTCCCCACCGGACAGGCCCTTGGCGGACACAAACGTTGTCACTACGAAGGTA 

ACGGTAACGGAAATAACAACAACAGTAACAGCGTTGTCACCGTCGCCTCGGAAGGCGTGGGC 

TCCACCCACACTGTCAGTCACGGCCACCACCGCGACTTCGATCTCAACATCCCGGCCTTTCC 

GGATTTTTCGACCAAGGTCGGAGAAGACGAGGTTGAGAGCCCTCACCCTGTCATGAAGAAGC 

CTCGCCTCTTCGTCATTCCCAAGATCGAAATCCCCCAATTTCAATGAACTCGTTGAATTTTT 

AGTTTATTTTTCGACTATATATTTTGGAGAATTTTGAGAGTTACTATAATTTGATTTTGTAC 

ATAGTACTTGGAAGTTTTGTTGGACCGTACCGGACCCAGTTCTCTGGTTGAGGTTGTACTTT 

CACAACAGTGGCAGATTTGCAATTCAATTCAATTTATTTGTTTATTTTAAAAAAAAAAAAAA 

AAAA 
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SEQ ID NO 13: Gm_T09602 Glycine max (soybean) probable zinc 
finger protein SCOF-1, STZ otholog, protein 

MALEALNSPTTTAPSFPFDDPTIPWAKRKRSKRSRDHPSEEEYLALCLIMLARGGTTTVNNR 
HVSPPPLQPQPQPTPDPSTKLSYKCSVCDKSFPSYQA1GGHKASHRKLAGAAEDQPPSTTTS 
SAAATSSASGGKAHECSICHKSFPTGQALGGHKRCHYEGNGNGNNNNSNSWTVASEGVGST 
HTVSHGHHRDFDLNIPAFPDFSTKVGEDEVESPHPVMKKPRLFVIPKIEIPQFQ 

SEQ ID NO 14: Ms_CAB77055_ Y18788 . 1_MSY18788 Medicago sativa 

putative TFIIIA (or kruppel) -like zinc finger protein mRNA 

AATTCGGCACGAGAAATAACCACTTCTCTCTCAAAACCTCCTTTTGCCTTTTGCTTCTACTT 

TCACTTGCGTAACGCTAACTAACTCTTCTCGAGTGTTCTTCTTTTCATCATATGGCTATGGA 

AGCACTTAACTCACCCACCACTGCTACTCCTTTCACACCCTTTGAGGAACCAAATCTGAGTT 

ATCTTGAAACACCGTGGACGAAAGGTAAACGATCAAAGCGTTCTCGCATGGATCAATCTTCA 

TGCACTGAAGAAGAGTATCTCGCTCTTTGTCTCATCATGCTTGCTCGCAGCGGTAACAACAA 

CGACAAAAAGTCTGATTCGGTGGCGACGCCGCTAACCACCGTTAAACTCAGTCACAAATGCT 

CAGTCTGCAACAAAGCTTTCTCATCTTATCAAGCCCTAGGTGGACACAAAGCCAGTCACCGG 

AAAGCTGTTATGTCCGCAACCACCGCTGAAGATCAGATCACCACCACTTCATCCGCCGTGAC 

TACCAGCTCTGCTTCCAACGGTAAGAACAAGACTCATGAGTGTTCCATCTGTCACAAATCCT 

TCCCTACTGGACAGGCTTTGGGAGGACACAAGCGTTGTCACTACGAAGGCAGCGTTGGTGCC 

GGTGCCGGTGCTGGAAGTAACGCTGTAACTGCCTCTGAAGGAGTTGGATTGTCACACAGCCA 

CCACCGTGATTTTGATCTTAACCTCCCGGCTTTTCCGGACTTTTCAAAGAAGTTTTTCGTGG 

ATGACGAGGTTTTTAGTCCTTTACCTGCTGCAAAGAAGCCCTGTCTTTTCAAGCTGGAAATT 

CCTTCTCATTACTGATCAATAATAGATCCAATTTTATTGTTATTATTATTAATAATTATTAT 

CGCTTAGGGCATAGTTATTTTCTTTTTTCTTTCAATTATTTCGGATCAATTTGTTCTGTACA 

TACAAATTGGGATTGGTTTTAGAATTTAGGACGGTTGTAGACAATGGAAATTCAATTCAATT 

ATTTAATTTTGTGT 

SEQ ID NO 15: Ms_CAB77055_Medicago sativa putative TFIIIA (or 

kruppel) -like zinc finger protein, STZ otholog, protein 

MAMEALNSPTTATPFTPFEEPNLSYLETPWTKGKRSKRSKMDQSSCTEEEYLAIjCLIMIjARS 

GNNNDKKSDSVATPLTTV1CLSHKCSVCNKAFSSYQALGGHKASHKKAVMSATTAEDQITTTS 

SAVTTSSASNGKNKTHECSICHKSFPTGQALGGHKRCHYEGSVGAGAGAGSNAVTASEGVGL 

SHSHHRDFDLNLPAFPDFSKKFFVDDEVFSPLPAAKKPCLFKLEIPSHY 

SEQ ID NO 16: Nt_AAC06243_ AF053077 Nicotiana tabacum osmotic 

stress-induced zinc-finger protein (zfp) mRNA, complete cds 

TTTTCCCTCGAATTTGATAACTAAAGAGAATATTATGACTCTTGAAGCTTTGAAGTCACCTA 

CGGCGGCAACGCCGACTCTACCACCACGCTAraAAGATGATGATGAAATTCATAATTTGGAT 

TCTTGGGCTAAAGGAAAACGATCAAAACGGCCCCGTATTGATGCCCCACCGACTGAAGAAGA 

GTATTTAGCCCTCTGTCTCATCATGCTCGCTCGCAGCGGAACCGGAACCAGAACCGGTTTAA 

CTGATGCTACTACTTCCCAACAACCTGCCGATAAAAAAACCGCCGAGTTGCCGCCGGTTCAT 

AAGAAAGAGGTGGCAACAGAGCAAGCAGAGCAATCTTACAAGTGTAGCGTGTGTGACAAGGC 

TTTTTCTTCTTATCAAGCACTCGGTGGGCATAAAGCAAGTCACCGTAAAACTACTACTACTG 

CTACCGCCGCCTCTGATGATAACAATCCTTCAACTTCAACTTCCACTGGCGCCGTTAATATC 

TCTGCTCTTAATCCAACTGGTCGTTCACACGTCTGTTCTATTTGCCACAAGGCTTTTCCTAC 

TGGCCAAGCTTTGGGTGGGCACAAGCGCCGCCACTATGAAGGCAAACTCGGTGGTAACAGCC 

GCGACTTAGGCGGCGGCGGCGGCGGCGGTCATAGTGGAAGCGTCTTGACTACTTCAGACGGC 
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GGCGCGTCGACTCACACGCTACGTGACTTTGACCTGAACATGCCTGCTTCGCCGGAATTGCA 
ACTGGGTCTGAGTATTGATTGTGGACGGAAAAGTCAACTGTTGCCGATGGTCCAAGAGGTGG 
AAAGTCCTATGCCTGCAAAGAAACCGCGTTTATTGTTTTCGTTGGGTTCaAACTTCTTTAGG 
GGAATTGAATTGATTGTGTTTTAGCCAAATTAGTAAATTGGTTCATGTGATTTTATTTTTAG 
GAAAAGGAATTATTGATTGTTTTACCCGTTTATTCTTAGGGTGGTATTATGTACAGGGAGTG 
AATCATTCATTGGTTTTACACTTTCTTAATTATATATTCTTTTTTTTTACACATAAAAAAAA 
AAAAAAA 

SEQ ID NO 17: Nt_AAC06243_Nicotiana tabacum osmotic stress- 
induced zinc-finger protein, STZ otholog, protein 
MTLEALKSPTAATPTLPPRYEDDDEIHNLDSWAKGKRSKRPRIDAPPTEEEYLALCLIMLAR 
SGTGTRTGLTDATTSQQPADKKTAELPPVHKKEVATEQAEQSYKCSVCDKAFSSYQALGGHK 
ASHRKTTTTATAASDDNNPSTSTSTGAVNISALNPTGRSHVCSICHKAFPTGQALGGHKRRH 
YEGKLGGNSRDLGGGGGGGHS GS VLTT S DGGASTHTLRDFDLNMPASPELQLGL S I DCGRKS 
QLLPMVQEVE S PMP AKKPRLL FS LG 

SEQ ID NO 18: Os_AF332876 Oryza sativa zinc finger 
transcription factor ZF1 mRNA, complete cds 

AATTCGGCACGAGGCCACACAGCAACCAGCCAGCTGCCACACTAGCTTGAGGCGAGCGAGCG 
AAGCTTAGCTAGCGGATAGAACAAGTCGTCGATCTGCTTGCTGCTTTTGTGAATTGCGGTGG 
AAGCATGTCGAGCGCGTCGTCCATGGAAGCGCTCCACGCCGCGGTGCTCAAGGAGGAGCAGC 
AGCAGCACGAGGTGGAGGAGGCGACGGTCGTGACGAGCAGCAGCGCCACGAGCGGGGAGGAG 
GGCGGACACCTGCCCCAGGGGTGGGCGAAGCGGAAGCGGTCGCGCCGCCAGCGATCGGAGGA 
GGAGAACCTCGCGCTCTGCCTCCTCATGCTCGCCCGCGGCGGCCACCACCGCGTCCAGGCGC 
CGCCTCCGCTCTCGGCTTCGGCGCCCCCGCCGGCAGGTGCGGAGTTCAAGTGCTCCGTCTGC 
GGCAAGTCCTTCAGCTCCTACCAGGCGCTCGGCGGCCACAAGACGAGCCACCGGGTCAAGCT 
GCCGACTCCGCCCGCAGCTCCCGTCTTGGCTCCCGCCCCCGTCGCCGCCTTGCTGCCTTCCG 
CCGAGGACCGCGAGCCAGCCACGTCATCCACCGCCGCGTCCTCCGACGGCAXGACCAACAGA 
GTCCACAGGTGTTCCATCTGCCAGAAGGAGTTCCCCACCGGGCAGGCGCTCGGCGGGCACAA 
GAGGAAGCACTACGACGGTGGCGTAGGCGCCGGCGCCGGCGCATCTTCAACCGAGCTCCTGG 
CCACGGTGGCCGCCGAGTCCGAGGTGGGAAGCTCCGGCAACGGCCAGTCCGCCACCCGGGCG 
TTCGACCTCAACCTCCCGGCCGTGCCGGAGTTCGTGTGGCGGCCGTGCTCCAAGGGCAAGAA 
GATGTGGGACGAGGAGGAGGAGGTCCAGAGCCCCCTCGCCTTCAAGAAGCCCCGGCTTCTCA 
CCGCGEaATTCAGCAGCTGCACGGATCCGATCCGTCAGAGTTTTTGTCTAGGGAGTGAAATT 
CAGTCGAAACACACTATTCGTTGATTCGTTTTGTGCCGCTATTGTTTAATTTGTTCCTGCTT 
TTGTACAGAGCAAGCGAGTGATACATAGCCATACATACAGTCATACAGATATAGGTCTAGCT 
CTTCCTTGGTTCTTTGTAACACTGGAACTGTACCTGTATCTTTTACACTTTGTTCTTTGACA 
GTCATATATTGTAGACCAAAAAAAAAAAAAAAAAA 

SEQ ID NO 19: Os_AF332876 Oryza sativa seedling zinc finger 
transcription factor ZF1, STZ otholog, protein 

MS SASSMEALHAAvXKEEQQQHEVEEATVVT s s S ATS GEEGGH1PQGWAKRKRSRRQRSEEE 
NIiALCLLMIiARGGHHRVQ AP P PL S AS APP PAGAEFKC SVC GKS F S S YQ ALGGHKT S HRVKLP 
TPPAAPVLAPAPVAALLPSAEDREPATSSTAASSDGMTNRVHRCSICQKEFPTGQALGGHKR 
KHYDGGVGAGAGASSTELIATVAAESEVGSSGNGQSATRAFDLNLPAVPEFVWRPCSKGKKM 
WDEEEEVQS P LAFKKPRLLT A 
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SEQ ID NO 20: Ph_BAA05079_D26086 .1 [Petunia x hybrida] . 
PETZFP4 zinc- finger protein gene ■ 

TTCACTCACCAAAACAACTTCTCTACCTCTTCTACTTGCACATTCAAATTCTTTCATTACTA 
CTTATCTCTACTAATCTTGATTCGATTTTAGTAAATCAAACAAGAGAATCTTTTCAGTAATA 
CAAACAAGAAAATTTTCTCTCTATACTTGATTGAGTTTAGTAAGGCAAACAAGAAAACTATC 
ATGGCACTTGAAGCATTGAATTCTCCAACTACAACAACACCACCATCATTCCAATTTGAGAA 
CAACGGGCTTAAGTACCTTGAGAGTTGGACAAAAGGTAAAAGATCAAAAAGGCAACGCAGCA 
TGGAACGACAGTGTACTGAAGAAGAGTATTTAGCACTTTGTCTTATCATGCTAGCACGTAGC 
GATGGTTCTGTTAATAACTCACGGTCTCTACCACCACCACCACTACCACCATCAGTTCCAGT 
AACGTCGCAAATAAACGCGACGTTATTGGAACAGAAGAATTTGTACAAGTGTTCCGTTTGTG 
GTAAAGGGTTTGGGTCTTATCAAGCTTTAGGTGGACATAAAGCAAGTCACCGGAAACTTGTC 
AGCATGGGAGGAGATGAACAATCTACTACTTCCACTACTACTAACGTAACGGGAACTAGTTC 
CGCTAACGTTAACGGTAACGGAAGAACTCACGAATGTTCAATTTGTCACAAGTGCTTTCCTA 
CTGGACAAGCTTTAGGTGGTCATAAAAGGTGCCACTATGACGGTGGTAACGGTAACGGTAAC 
GGAAGTGTAAGTGTTGGGGTGACGTCATCTGAAGGTGTGGGGTCCACTATTAGTCATCACCG 
TGACTTTGACTTGAATATTCCCGCGTTGCCGGAGTTTTGGCCGGGATTTGGTTCCGGCGAGG 
ATGAGGTGGAGAGTCCTCATCCAGCAAAGAAGTCAAGGCTATCTCTTCCACCTAAACTTGAA 
TTATTCAAAGGATTATAGAGGGAATATTGATTTGTTACAGGAAGATTTATTAGGATTCACGA 
ATTTTTTGTTGACTAGTTTATGTAATAT 

SEQ ID NO 21: Ph_BAA05079 [Petunia x hybrida] zinc-finger 
protein, STZ otholog, protein 

MALEALNSPTTTTPPSFQFENNGIJCYLESWTKGKRSKRQRSMERQCTEEEYLALCLIMIiARS 
DGSVNNSRSLPPPPLPPSVPVTSQINATLLEQKNLYKCSVCGKGFGSYQALGGHKASHRKLV 
SMGGDEQSTTSTTTNVTGTSSANVNGNGRTHECSICHKCFPTGQALGGHKRCHYDGGNGNGN 
GSVS VGVTSSEGVGST I SHHRDFDLNI PALPEFWPGFGSGEDEVESPHPAKKSRLSLPPKLE 
LFKGL 

SEQ ID NO 22: Ta_BAA03901 Triticum aestivum gene for zinc- 
finger protein WZF1, complete cds 

ATGTCGTCGTCGGCCATGGAAGCGCTCCACGCCCTGATCCCGGAGCAGCACCAGCTGGACGT 
TGAGGCGGCTGCGGCTGTCAGCAGCGCCACCAGCGGCGAGGAGAGCGGCCACGTGCTGCAGG 
GGTGGGCCAAGAGGAAGCGATCGCGCCGCCAGCGCTCCGAGGAGGAGAACCTCGCGCTCTGC 
CTCCTCATGCTCTCGCGCGGCGGCAAGCAGCGTGTTCAGGCGCCGCAGCCGGAGTCGTTCGC 
TGCGCCGGTGCCTGCCGAGTTCAAGTGCTCCGTCTGCGGCAAGTCCTTCAGCTCCTACCAGG 
CGCTCGGAGGCCACAAGACGAGCCACCGGGTGAAGCAGCCGTCTCCTCCCTCTGATGCCGCT 
GCTGCCCCACTCGTGGCCCTCCCGGCCGTCGCCGCCATCCTGCCGTCCGCCGAGCCGGCCAC 
GTCGTCCACCGCCGCGTCCTCCGACGGCGCGACCAACAGAGTCCACAGGTGCTCCATCTGCC 
AAAAGGAGTTCCCGACTGGGCAGGCGCTCGGCGGGCACAA.GAGGAAGCACTACGACGGAGGC 
GTGGGCGCCGCCGCCTCGTCGACCGAGCTTCTGGCCGCCGCGGCCGCCGAGTCTGAGGTGGG 
GAGCACCGGCAACGGGAGCTCCGCCGCCCGGGCCTTCGACCTGAACATTCCGGCCGTGCCGG 
AGTTCGTGTGGAGGCCGTGCGCCAAGGGCAAGATGATGTGGGAGGACGATGAGGAGGTGCAG 
AGCCCCCTCGCCTTCAAGAAGCCTCGGCTTCTCACCGCTTGA 
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SEQ ID NO 23: Ta_BAA03901_WZFl Triticum aestivum, STZ otholog, 
protein 

MSSSAMEALH^IPEQHQLDVEAAAAVSSATSGEESGHVLQGWAKRKRSRRQRSEEENIALC 
LLMLSRGGKQRVQAPQPESFAAPVPAEFKCSVCGKSFSSYQALGGHKTSHRVKQPSPPSDAA 
AAPLVALPAVAAILPSAEPATSSTAASSDGATNRVHRCSICQKEFPTGQALGGHKRKHYDGG 
VGAAASSTELLAAAAAESEVGSTGNGSSAARAFDLNIPAVPEFVWRPCAKGKMMWEDDEEVQ 
SPLAFKKPRLLTA 

SEQ ID NO 24: Ca AF539746 Capsicum annum zinc finger protein 
mRNA, complete cds 

AAAATCTTCGCTACTTACTTACATCTTCTAGAATAGTCACTAGAACCAGTAACTTTATACAA 
CGGATATCGATArflGCACTTGAAGCTTTGAATTCTCCAACTGGTACACCAACTCCGCCACCG 
TTTCAATTTGAGAGCGACGGCCAACAGCTTCGATATATCGAAAACTGGAGGAAGGGAAAGAG 
ATCTAAAAGGTCACGCAGCATGGAGCACCAGCCTACTGAGGAAGAATACTTAGCGCTTTGTT 
TGATCATGCTTGCACGTAGCGGTGGCTCCGTTAATCATCAACGATCTCTACCACCGCCGGCT 
CCGGTGATGAAACTGCACGCGCCGTCGTCATCATCGGCGGCGGAGGAGGAGAAGGAGAAGAT 
GGTGTATAAGTGTTCGGTTTGTGGTAAGGGATTTGGGTCTTATCAAGCTTTAGGTGGACACA 
AAGCTAGTCACCGGAAACTCGTACCCGGCGGAGATGATCAGTCAACTACCTCCACAACCACT 
AACGCAACCGGAACAACAACCTCCGTTAACGGCAACGGCAACAGAAGTGGAAGGACTCACGA 
GTGTTCGATTTGTCACAAGTGTTTTCCCACTGGACAAGCTTTAGGTGGACACAAAAGGTGTC 
ACTACGACGGCGGTATCGGTAACGGAAACGCTAACAGTGGCGTTAGTGCTAGCGTTGGAGTG 
ACGTCATCGGAGGGTGTGGGGTCCACAGTCAGTCACCGGGATTTCGACTTGAACATTCCGGC 
GTTGCCGGAATTCTGGCTGGGATTTGGTICCGGCGAAGATGAGGTGGAGAGTCCACATCCGG 
CGAAGAAATCGCGGTTATGTTTGCCTCCAAAATATGAATTATTTCAACATMATGGGAATTT 
GATTGTTAGGATTTACTATTTTGGTAGACAAAATTATACTATGTAAGTTTTAATTTTCATTG 
TGGGTGGGAGCAAAATTTTTAATTTTTTGTCTATAGACCTAGCTAGTTACTAATAGCAAAAA 
TTCAATTGATTGATTTAAAAAAAAAAAAAAAAAA 

SEQ ID NO 25: Ca AF539746-capsicum annum, STZ otholog, protein 
MALEALNSPTGTPTPPPFQFESDGQQLRYIENWRKGKRSKRSRSMEHQPTEEEYLALCL1ML 
ARSGGSVNHQRSLPPPAPVMKLHAPSSSSAAEEEKEKMVYKCSVCGKGFGSYQALGGHKASH 
RKLVPGGDDQSTTSTTTNATGTTTSVNGNGNRSGRTHECSICHKCFPTGQALGGHKRCHYDG 
GIGNGNANSGVSASVGVTSSEGVGSTVSHRDFDLNIPALPEFWLGFGSGEDEVESPHPAKKS 
RLCLPPKYELFQH 

PARALOGS OF STZ IN ARABIDOPSIS THALIANA 

SEQ ID NO 2S: gi_18402298_ref_NM_112848 . 1 mRNA 

ACTTCACTCTCTAATTTCCTTCTCTCTATCTCTCACCATATTCGCGATTAAAAACTCTCAAC 
TTTTCTCTCAAATTTCTGATCCTTTGATCCAACAGTTAGAAGAAGATTCATCTGATCATGGC 
CCTCGAAGCGATGAACACTCCAACTTCTTCTTTCACCAGAATCGAAACGAAAGAAGATTTGA 
TGAACGACGCCGTTTTCATTGAGCCGTGGCTTAAACGCAAACGCTCCAAACGTCAGCGTTCT 
CACAGCCCTTCTTCGTCTTCTTCCTCACCGCCTCGATCTCGACCCAAATCCCAGAATCAAGA 
TCTTACGGAAGAAGAGTATCTCGCTCTTTGTCTCCTCATGCTCGCTAAAGATCAACCGTCGC 
AAACGCGATTTCATCAACAGTCGCAATCGTTAACGCCGCCGCCAGAATCAAAGAACCTTCCG 
TACAAGTGTAACGTCTGTGAAAAAGCGTTTCCTTCCTATCAGGCTTTAGGCGGTCACAAAGC 
AAGTCACCGAATCAAACCACCAACCGTAATCTCAACAACCGCCGATGATTCAACAGCTCCGA 
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CCATCTCCATCGTCGCCGGAGAAAAACATCCGATTGCTGCCTCCGGAAAGATCCACGAGTGT 
TCAATCTGTCATAAAGTGTTTCCGACGGGTCAAGCTTTAGGCGGTCACAAACGTTGTCACTA 
CGAAGGCAACCTCGGCGGCGGAGGAGGAGGAGGAAGCAAATCAATCAGTCACAGTGGAAGCG 
TGTCGAGCACGGTATCGGAAGAAAGGAGCCACCGTGGATTCATCGATCTAAACCTACCGGCG 
TTACCTGAACTCAGCCTTCATCACAATCCAATCGTCGACGAAGAGATCTTGAGTCCGTTGAC 
CGGTAAAAAACCGCTTTTGTTGACCGATCACGACCAAGTCATCAAGAAAGAAGATTTATCTT 
TAAAAATCTAATACTCGACTATTAATTCTTGTGTGATTTTTTTCGTTACAACCATAGTTTCA 
TTTTCATTTTTTTAGTTACAAATTTTTAATTGTTCTGATTTGGATTGAATATTGGTATATTG 
TTAGGGGTTGATAC 

SEQ ID NO 27: Translation of gi_18402298_ref_NM_112848 . 1_ 
MALEAMNTPTS S FTRIETKEDLMNDAVFIEPWLKRKRSKRQRSHSP SS SS SS PPRSRPKSQN 
QDLTEEEYLALCLLMLAKDQPSQTRFHQQSQSLTPPPESKNLPYKCNVCEKAFPSYQALGGH 
KASHRIKPPT VI STTADDST APTIS I VAGEKHPIAAS GKI HECS ICHKVFPTGQALGGHKRC 
HYEGNLGGGGGGGSKSISHSGSVSSTVSEERSHRGFIDLNLPALPELSLHHNPIVDEEILSP 
LTGKKPLLLTDHDQVIKKEDLSLKI 

SEQ ID NO 28: gi_30680473_ref_NM_120516 . 3_mRNA 

AAATCAAATCTTTTCATTTACAATTATCTTTCTTCTCAATTTAGAACTTAGTAGCTAGTCTT 
CAAGATAATGGCACTTGAAACTCTTACTTCTCCAAGATTATCTTCTCCGATGCCGACTCTGT 
TTCAAGATTCAGCACTAGGGTTTCATGGAAGCAAAGGCAAACGATCTAAGCGATCAAGATCT 
GAATTCGACCGTCAGAGTCTCACGGAGGATGAATATATCGCTTTATGTCTCATGCTTCTTGC 
TCGCGACGGAGATAGAAACCGTGACCTTGACCTGCCTTCTTCTTCGTCTTCACCTCCTCTGC 
TTCCTCCTCTTCCTACTCCGATCTACAAGTGTAGCGTCTGTGACAAGGCGTTTTCGTCTTAC 
CAGGCTCTTGGTGGACACAAGGCAAGTCACCGGAAAAGCTTTTCGCTTACTCAATCTGCCGG 
AGGAGATGAGCTGTCGACATCGTCGGCGATAACCACGTCTGGTATATCCGGTGGCGGGGGAG 
GAAGTGTGAAGTCGCACGTTTGCTCTATCTGTCATAAATCGTTCGCCACCGGTCAAGCTCTC 
GGCGGCCACAAACGGTGCCACTACGAAGGAAAGAACGGAGGCGGTGTGAGTAGTAGCGTGTC 
GAATTCTGAAGATGTGGGGTCTACAAGCCACGTCAGGAGTGGCCACCGTGGGTTTGACCTCA 
ACATACCGCCGATACCGGAATTCTCGATGGTCAACGGAGACGAAGAGGTGATGAGTCCTATG 
CCGGCGAAGAAACTCCGGTTTGACTTCCCGGAGAAACCCTAAACATAAACCTAGGAAAAACT 
TTACAGAATTCATTTTATAGGAAATTGTTTTACTGTATATACAAATATCGATTTTGATTGAT 
GTTCTTCTTCACTGAAAAATTATGATTCTTTGTTGTATAATTGATGTTTCTGAAAAAGATAT 
AACTTTTTATTGTTTCACACGTATCAAAATTTGCTTGGATACATCA 

SEQ ID NO 29: Translation of gi_30680473_ref_NM_120516 . 3_ 
MALETLTSPRLSSPMPTLFQDSALGFHGSKGKRSKRSRSEFDRQSLTEDEYIALCLMLLARD 
GDRNRDLDLPSSSSSPPLLPPLPTPIYKCSVCDKAFSSYQALGGHKASHRKSFSLTQSAGGD 
ELSTSSAITTSGISGGGGGSVKSHVCSICHKSFATGQALGGHKRCHYEGKNGGGVSSSVSNS 
EDVGSTSHVS SGHRGFDLNI PPI PE FSMVNGDEEVMS PMP AKKLRFD FPEKP 
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SEQ ID NO 30: gi 30693252_ref_NM_114853. 2_mRNA 

ATGGCTCTCGACACTCTCAATTCTCCCACCTCCACCACCACAACCACCGCTCCTCCTCCTTT 
CCTCCGTTGCCTCGACGAAACCGAGCCCGAAAACCTCGAATCATGGACCAAAAGAAAACGTA 
CAAAACGTCACCGTATAGATCAACCAAACCCTCCTCCTTCTGAAGAAGAGTATCTCGCTCTT 
TGCCTCCTTATGCTCGCTCGTGGCTCCTCCGATCATCACTCTCCACCGTCGGATCATCACTC 
TCTTTCTCCACTGTCCGATCATCAGAAAGATTACAAGTGTTCCGTCTGTGGCAAATCTTTCC 
CGTCTTACCAAGCGTTAGGTGGACACAAAACAAGTCACCGGAAACCGGTTAGTGTCGATGTT 
AATAATAGTAACGGAACCGTTACTAATAACGGAAATATTAGTAACGGTTTAGTTGGTCAAAG 
TGGGAAGACTCATAACTGCTCTATATGTTTTAAGTCGTTTCCCTCTGGTCAAGCATTGGGTG 
GTCACAAACGTTGTCACTATGATGGTGGTAACGGTAACAGTAACGGTGACAATAGCCACAAG 
TTTGACCTAAATTTACCGGCTGATCAAGTTAGTGATGAGACAATTGGAAAAAGTCAACTCTC 
CGGTGAAGAAACAAAGTCGGTGTTGTGATTATTATTATTTTTTACCGATCGGGATTAGCTAG 
TGGTTGATCATTAGCTGAGTCTGTAATGAAAATGAT 

SEQ ID NO 31: Translation of gi_30693252_ref_NM_114853 .2 
MALDTLNSPTSTTTTTAPPPFLRCLDETEPENLESWTKRKRTKRHRIDQPNPPPSEEEYLAL 
CLLMLARGSSDHHSPPSDHHSLSPLSDHQKDYKCSVCGKSFPSYQALGGHKTSHRKPVSVDV 
NNSNGTVTNNGNISNGLVGQSGKTHNCSICFKSFPSGQALGGHKRCHYDGGNGNSNGDNSHK 
FDLNLPADQVS DE T I GKS QLS GEE TKS VL 

SEQ ID NO 32:. gi_30694224_ref_NM_123683 .2_mRNA 

AAATTTTCTATAGCAATGGCGCTTGAAGCTCTTAATTCACCAAGATTGGTCGAGGATCCCTT 
AAGATTCAATGGCGTTGAGCAGTGGACCAAATGTAAGAAACGATCCAAACGTTCGAGATCTG 
ATCTTCATCATAACCACCGTCTCACTGAGGAAGAGTATCTAGCTTTCTGTCTCATGCTTCTT 
GCTCGGGATGGCGGCGATCTTGACTCTGTGACGGTTGCGGAGAAGCCGAGTTATAAGTGTGG 
CGTTTGTTACAAGACGTTTTCGTCTTACCAAGCTCTCGGCGGTCATAAAGCGAGCCACCGGA 
GCTTATACGGTGGTGGAGAGAATGATAAATCGACACCATCCACCGCCGTGAAATCTCACGTT 
TGTTCGGTTTGCGGGAAATCTTTCGCCACCGGTCAAGCTCTCGGCGGCCACAAGCGGTGCCA 
CTACGATGGTGGCGTTTCGAACTCGGAAGGTGTGGGGTCTACTAGCCACGTCAGCAGTAGTA 
GCCACCGTGGATTTGACCTTAATATTATACCGGTGCAGGGATTTTCGCCGGACGACGAAGTG 
ATGAGTCCGATGGCGACTAAGAAGCCTCGCCTGAAGTAAGTCTTTGTTGAAGACCTGGAAGT 
TTATCAAATGTAAATATCAAATTTCAATTTCAAGGAACAGTTTTGTTGATTCTATTACCAAT 
ACACAATACGATTCAATTCC 

SEQ ID NO 33: Translation of gi_30694224_ref_NM_123683 . 2_ 

MALEALNSPRLVEDPLRFNGVEQWTKCKKRSKRSRSDLHHNHRLTEEEYLAFCLMLLARDGG 

DLDSVTVAEKPSYKCGVCYKTFSSYQALGGHKASHRSLYGGGENDKSTPSTAVKSHVCSVCG 

KSFATGQALGGHKRCHYDGGVSNSEGVGSTSHVSSSSHRGFDLNIIPVQGFSPDDEVMSPMA 

TKKPRLK 
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SEQ ID NO 34: gi_30698307_ref_NM_126145 . 2_mRNA 

CACACTTCACTCTTTCTTCATCTTCTTCTTCTTAAATAGCTCGAAATCACATCTCACAGAAT 
TAAA.TCTTATGGCTCTCGAGACTCTCAATTCTCCAA.CAGCTACCACCACCGCTCGGCCTCTT 
CTCCGGTATCGTGAAGAAATGGAGCCTGAGAATCTCGAGCAATGGGCTAAAAGAAAACGAAC 
AAAACGTCAACGTTTTGATCACGGTCATCAGAATCAAGAAACGAACAAGAACCTTCCTTCTG 
AAGAAGAGTATCTCGCTCTTTGTCTCCTCATGCTCGCTCGTGGCTCCGCCGTACAATCTCCT 
CCTCTTCCTCCTCTACCGTCACGTGCGTCACCGTCCGATCACCGAGATTACAAGTGTACGGT 
CTGTGGGAAGTCCTTTTCGTCATACCAAGCCTTAGGTGGACACAAGACGAGTCACCGGAAAC 
CGACGAACACTAGTATCACTTCCGGTAACCAAGAACTGTCTAATAACAGTCACAGTAACAGC 
GGTTCCGTTGTTATTAACGTTACCGTGAACACTGGTAACGGTGTTAGTCAAAGCGGAAAGAT 
TCACACTTGCTCAATCTGTTTCAAGTCGTTTGCGTCTGGTCAAGCCTTAGGTGGACACAAAC 
GGTGTCACTATGACGGTGGCAACAACGGTAACGGTAACGGAAGTAGCAGCAACAGCGTAGAA 
CTCGTCGCTGGTAGTGACGTCAGCGATGTTGATAATGAGAGATGGTCCGAAGAAAGTGCGAT 
CGGTGGCCACCGTGGATTTGACCTAAACTTACCGGCTGATCAAGTCTCAGTGACGACTTCTX 
AACGTTGACTGAGTTTGAGGAAAAAGTCAACTATCAAGCGAAGAAAGGGTTAGTGGACGGTG 
AAGATTAACGGTCGTTTCTTTCCAGTTGCTTCGGTTTGAGCTTGACTGGGTCTGTAATGAAA 
ATGATTGGAGTGGACTTGGCATTATTATTATTATTTTTAAAAAGAAATGTTAATTTGTTGTT 
GGATTTGTTTATAGATAGAGGAAACAATTGGGATACACAAATATTTTTTTTTTTTACAAAGA 
AAATAATAATGCAGAGATGGATGATTGGATCGTACACGTTATTATATAGTGGACCATTCTGT 
AATCGTGAATTATTATTATTTGTTAGAAATTTAATTTTCGT 

SEQ ID NO 35: Translation of gi_30698307_ref_NM_126145 . 2_ 

MALETLNSPTATTTARPLLRYREEMEPENLEQWAKRKRTKRQRFDHGHQNQETNKNLPSEEE 

YLALCL1MIARGSAVQSPPLPPLPSRASPSDHRDYKCTVCGKSFSSYQALGGHKTSHRKPTN 

TSITSGNQELSNNSHSNSGSWINVTVNTGNGVSQSGKIHTCSICFKSFASGQALGGHKRCH 

YDGGNNGNGNGSSSNSVELVAGSDVSDVDNERWSEESAIGGHRGFDLNLPADQVSVTTS 

OTHER GENES IN EVALUATION 

SEQ ID NO 36: gi_ 12698881_ref_ AF_332876 . 1 jmRNA, 2xC2H2, 
Oryza sativa 

AATTCGGCACGAGGCCACACAGCAACCAGCCAGCTGCCACACTAGCTTGAGGCGAGCGAGCG 
AAGCTTAGCTAGCGGATAGAACAAGTCGTCGATCTGCTTGCTGCTTTTGTGAATTGCGGTGG 
AAGCATGTCGAGCGCGTCGTCCATGGAAGCGCTCCACGCCGCGGTGCTCAAGGAGGAGCAGC 
AGCAGCACGAGGTGGAGGAGGCGACGGTCGTGACGAGCAGCAGCGCCACGAGCGGGGAGGAG 
GGCGGACACCTGCCCCAGGGGTGGGCGAAGCGGAAGCGGTCGCGCCGCCAGCGATCGGAGGA 
GGAGAACCTCGCGCTCTGCCTCCTCATGCTCGCCCGCGGCGGCCACCACCGCGTCCAGGCGC 
CGCCTCCGCTCTCGGCTTCGGCGCCCCCGCCGGCAGGTGCGGAGTTCAAGTGCTCCGTCTGC 
GGCAAGTCCTTCAGCTCCTACCAGGCGCTCGGCGGCCACAAGACGAGCCACCGGGTCAAGCT 
GCCGACTCCGCCCGCAGCTCCCGTCTTGGCTCCCGCCCCCGTCGCCGCCTTGCTGCCTTCCG 
CCGAGGACCGCGAGCCAGCCACGTCATCCACCGCCGCGTCCTCCGACGGCATGACCAACAGA 
GTCCACAGGTGTTCCATCTGCCAGAAGGAGTTCCCCACCGGGCAGGCGCTCGGCGGGCACAA 
GAGGAAGCACTACGACGGTGGCGTAGGCGCCGGCGCCGGCGCATCTTCAACCGAGCTCCTGG 
CCACGGTGGCCGCCGAGTCCGAGGTGGGAAGCTCCGGCAACGGCCAGTCCGCCACCCGGGCG 
TTCGACCTCAACCTCCCGGCCGTGCCGGAGTTCGTGTGGCGGCCGTGCTCCAAGGGCAAGAA 
GATGTGGGACGAGGAGGAGGAGGTCCAGAGCCCCCTCGCCTTCAAGAAGCCCCGGCTTCTCA 
CCGCGTAATTCAGCAGCTGCACGGATCCGATCCGTCAGAGTTTTTGTCTAGGGAGTGAAATT 
CAGTCGAAACACACTATTCGTTGATTCGTTTTGTGCCGCTATTGTTTAATTTGTTCCTGCTT 

FIGURE 3 (continued) 



10/537897 



WO 2004/058980 PCT/EP2003/051104 



11/17 

TTGTACAGAGCAAGCGAGTGATACATAGCCATACATACAGTCATACAGATATAGGTCTAGCT 
CTTCCTTGGTTCTTTGTAACACTGGAACTGTACCTGTATCTTTTACACTTTGTTCTTTGACA 
GTCATATATTGTAGACCAAAAAAAAAAAAAAAAAA 

SEQ ID NO 37: gi_12698882_ref_AAK01713.1, 2xC2H2, Oryza sativa 
MS S AS SME ALHAAVLKEE QQQHE VEEAT WT S S S AT S GEE GGHL PQGWAKRKRS RRQRSEEE 
NIjyiiCLLMLARGGHHRVQAPPPLSASAPPPAGAEFKCSVCGKSFSSYQALGGHKTSHRVKLP 
TPPAAPVIiAPAPVAALLPSAEDREPATSSTAASSDGMTNRVHRCSICQKEPPTGQALGGHKR 
KHYDGGVGAGAGASSTELIATVAAESEVGSSGNGQSATRAFDLNLPAVPEFVWRPCSKGKKM 
WDEEEEVQSPLAFKKPRLLTA 

SEQ ID NO 38: gi_6434215_ref_AL132966 . l_region 116202 ... 
116729, 2xC2H2, Arabidopsis thaliana 

ATGAAGAGAGACCGGTCCGATTACGAAGAATCCATGAAGCATATAGACATAGTAGAAAGTCT 
AATGATGTTATCTCGAAGTTTCGTGGTCAAACAAATCGATGTAAAGCAATCTACCGGAAGCA 
AAACGAACCATAATAACCACTTCGAATGCAAAACGTGTAACCGGAAATTTGATTCCTTCCAA 
GCTCTTGGAGGTCATAGAGCTAGCCACAAGAAACCTAAGCTGATCGTTGACCAAGAACAGGT 
GAAGCATCGTAACAAAGAGAATGATATGCATAAGTGTACAATTTGCGATCAAATGTTTGGGA 
CCGGTCAAGCTCTAGGCGGTCACATGAGAAAGCATAGGACGAGCATGATAACCGAGCAATCG 
ATTGTCCCTTCTGTGGTTTATTCCAGACCGGTTTTTAATCGTTGCAGTAGCAGCAAGGAGAT 
CTTGGACTTAAATCTAACTCCATTGGAAAATGATCTTGTGTTAATCTTTGGGAAGAATTTGG 
TTCCACAAATTGATTTGAAGTTTGTGAATTAG 

SEQ ID NO 39: gi_6729511_ref_CAB67667 . 1, 2xC2H2, Arabidopsis 
thaliana 

MKRDRS DYEESMKHIDIVESLMMLSRS FWKQI DVKQSTGSKTNHNNHFECKTCNRKFDS FQ 
ALGGHRASHKKPKLIVDQEQVKHRNKENDMHKCTICDQMFGTGQALGGHMRKHRTSMITEQS 
IVPSWYSRPVFNRCSSSKEILDLNLTPLENDLVLIFGKNLVPQIDLKFVN 

SEQ ID NO 40: ref_CA279020, 2xC2H2, sugar cane 

CCTAACCAGCATTAGCTTTTCAAATCAACAAGCCTCGCCGTGACCGATCGATGGCCATCACC 
CACGACGACTACGTCTCCCTCTGCCTCATGGCGCTCGCAGCCGCGGGAGGCGGAGGCCAAGC 
TGGTTTAACAACGCAGTACGCTCTGAACACGGCTGCCTGGACAGCGACGGCGCAAGAGTCCG 
AGCTCCGCTTCCGGTGCTCCGTCTGTGGCAAGGCCTTCGCGTCGCACCAGGCACTGGGCGGG 
CACAAGGCCAGCCACCGCAAGCCGACGCTCGTACAGGCACATGCGTCGTCCTCAGCCGGAGG 
CGCGGCGTCGTCGTCGGTAACAATGACCTCGGCCGTAGGCAGCAGTGGGCAGGGGAGGCACA 
GGTGCACGGTGTGCCATCGGAGCTTCGCGACGGNGCAAGCGCTCGGCGGGCACAAGAGGTGC 
CATTACTGGGACGGGCTCTCGGTGTCGCTCACCGCGTCGTCGGCGCCATCGGGGTCCGGGTC 
GACCGTCAAGGGCTTTGATCTGAATTTGGTGCCGGTGCCGCCCGCGATGGCCGCCAACGCTG 
CGACAAGGTGGGGAGAGGAGAANNAAGTCANAAACCCTTGGCGGTCAAGAGAAGGCGGCTTG 
CCGGTCCGTCTTGGACCCTAATTTAACGATTTAGAAGTCCTTTTTTTAATAATTAAGAGTTC 
TTTTGAAGAAGGTTGTAAAGTTTTCGAACCTTGTTCTTTTAATGGATTTGGGTGCTGGCGAA 
ATTTTAAAACTGGATTTAAATTTGCGCTCACTCTTTTTTTTTATTTTTTACACCCTTTTTTT 
TTTTTAGAAGAAGA 
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SEQ ID NO 41: gi_18027011_ref_AF254447 .1, 2xC2H2, Arabidopsis 
thaliana 

TTCCTTTCTCTTCCTCTCTCTCTCTCTTCACCATGACTGATCCTTATTCCAATTTCTTCACA 
GACTGGTTCAAGTCTAATCCTTTTCACCATTACCCTAATTCCTCCACTAACCCCTCTCCTCA 
TCCTCTTCCTCCTGTTACTCCTCCCTCTTCCTTCTTCTTCTTCCCTCAATCCGGAGACCTCC 
GCCGTCCACCGCCGCCACCAACTCCTCCTCCTTCTCCTCCTCTCCGAGAAGCCCTCCCTCTC 
CTCAGCCTCAGCCCCGCCAACAAACAACAAGACCACCATCACAACCATGACCACCTTATTCA 
AGAACCACCTTCAACCTCCATGGATGTCGACTACGATCATCACCATCAAGATGATCATCATA 
ACCTCGATGACGATGACCATGACGTCACCGTTGCTCTTCACATAGGCCTTCCAAGCCCTAGT 
GCTCAAGAGATGGCCTCTTTGCTCATGATGTCTTCTTCTTCCTCTTCCTCGAGGACCACTCA 
TCATCACGAGGACATGAATCACAAGAAAGACCTCGACCATGAGTACAGCCACGGAGCTGTCG 
GAGGAGGAGAAGATGACGATGAAGATTCAGTCGGCGGAGACGGCGGCTGTAGAATCAGCAGA 
CTCAACAAGGGTCAATATTGGATCCCTACACCTTCTCAGATTCTCATTGGCCCTACTCAGTT 
CTCATGTCCTGTTTGCTTCAAAACCTTCAACAGATACAATAACATGCAGATGCATATGTGGG 
GACATGGATCACAATACAGAAAAGGACCTGAATCTCTAAGGGGAACACAACCAACAGGAATG 
CTAAGGCTTCCGTGCTATTGCTGCGCCCCAGGCTGTCGCAACAACATTGACCATCCAAGGGC 
AAAGCCTCTCAAAGACTTCAGAACCCTTCAAACACATTACAAGAGAAAACATGGGATCAAAC 
CTTTCATGTGTAGGAAATGTGGAAAGGCTTTCGCAGTCCGAGGGGACTGGAGAACACATGAG 
AAGAATTGTGGCAAACTTTGGTATTGCATATGTGGATCTGATTTCAAGCACAAGAGATCTCT 
CAAAGATCACATCAAGGCTTTTGGGAATGGTCATGGAGCCTACGGAATTGATGGGTTTGATG 
AAGAAGATGAGCCTGCCTCTGAGGTAGAACAATTAGACAATGATCATGAGTCAATGCAGTCT 
AAAT AGCT T AT AT AT AT T AC TAT AAGT AC T AAG T AAT TC GGT AT AT AT AT TAAT T ATAAG AA 
ACCTAAATCTATGGACCAAGTTTTGATGGAGGTAGGGCTTTTCAAACTAAAAGCTATATCAT 
CTAATTGATCATAGGAAAAAAATGAATCAAGAGCACTTGGAAAATTTTAAATTGTATCTTTA 
GCTTCCTAGTTAAATTTATTGCAAGACAATGTAGCAGTCTAACCAATGAGGTTCCCAACGGT 
TTATTTCTATTTGTATATTATTTTGTCATTAGCTTCACCTTTCGTTAATTCGAAGGACATAA 
CTTATAAATGTTTAAATTATG 

SEQ ID NO 42: At3g57670, 2xC2H2, Arabidopsis thaliana 

MTDPYSNFFTDWFKSNPFHHYPNSSTNPSPHPLPPVTPPSSFFFFPQSGDLRRPPPPPTPPP 

SPPLREALPLLSLSPANKQQDHHHNHDHLIQEPPSTSMDVDYDHHHQDDHHNLDDDDHDVTV 

ALHIGLPSPSAQEMASLLMMSSSSSSSRTTHHHEDMNHKKDLDHEYSHGAVGGGEDDDEDSV 

GGDGGCRISRLNKGQYWIPTPSQILIGPTQFSCPVCFKTFNRYNNMQMHMWGHGSQYRKGPE 

SLRGTQPTGMLRLPCYCCAPGCRNNIDHPRAKPiKDFRTLQTHYKRKHGIKPFMCRKCGKAF 

AVRGDWRTHEKNCGKLWYCICGSDFKHKRSLKDHIKAFGNGHGAYGIDGFDEEDEPASEVEQ 

LDNDHESMQSK 

SEQ ID NO 43: gi_18676370_ref_AJ311810 .2, 2xC2H2, Arabidopsis 
thaliana 

ATCTACACACTACTACTCACATCTCATCTCTCTCTAGCACATACCCATCAAACCATATAGAT 
ACGGTGCTTTTATTCTTGATCTTCTTCTTCTTCTTTGTCTTCTCCTCAGAGTCATGTCTAAT 
CCAGCTTGTTCGAATCTCTTCAACAATGGATGTGACCATAATAGCTTCAACTATTCCACTTC 
TCTCTCTTACATTTACAACTCTCACGGTAGCTACTATTACTCTAATACCACAAACCCTAATT 
ACATTAATCATACTCATACCACTTCCACTTCCCCTAACTCACCCCCACTAAGAGAAGCTCTT 
CCTCTTCTTAGCTTAAGCCCCATAAGGCACCAAGAACAACAAGACCAACACTATTTCATGGA 
CACCCATCAAATTAGCTCTTCAAACTTTCTTGATGATCCTCTTGTGACTGTGGATCTTCATC 
TAGGGTTACCAAACTACGGTGTTGGTGAGAGCATTAGGAGCAATATTGCTCCTGATGCAACC 
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ACGGACGAGCAAGATCAAGATCATGACCGAGGAGTAGAAGTCACAGTTGAGTCCCACCTTGA 
TGATGATGATGATCATCATGGAGATCTACACAGAGGTCATCACTATTGGATTCCTACTCCTT 
CTCAGATTTTGATTGGTCCTACACAGTTCACTTGTCCTCTTTGCTTCAAGACATTCAACAGA 
TACAACAACATGCAGATGCACATGTGGGGACACGGCTCACAATACAGAAAGGGACCAGAATC 
CTTAA.GAGGAA.CCCAACCAACAGGAATGCTAAGACTACCATGTTTCTGCTGTGCACCCGGTT 
GCAAGAACAACATTGACCACCCACGAGCCAAGCCTCTTAAGGACTTTCGAACCCTCCAAACA 
CATTACAAACGTAAACATGGGTCTAAACCATTTGCTTGTCGTATGTGTGGTAAGGCCTTTGC 
AGTGAAA.GGAGATTGGAGAACGCATGAGAAGAA.TTGTGGAAAGCTTTGGTATTGCTCTTGTG 
GCTCGGATTTTAAGCACAAGAGGTCGCTTAAGGACCATGTCAAGGCCTTTGGAAATGGTCAT 
GTTCCTTGTGGGATTGATAGTTTTGGAGGAGATCATGAGGACTACTATGATGCTGCTTCTGA 
TATCGAGCAATAAGATGATAGCAACAACAATGAGTGTTAATTAGGGGTTTTGTTTATTTTTC 
CTCTCATGCATTAGTTGATTGTATGCACGTGTTCTTTAGTTTTGTTCTTCGGATCTTTGTTT 
TATTTTGTTTTGAGCTGTTTTTTTTTTAATTACTAAGAAGTTAATTATCATCTAAA.GATTTT 

C 

SKQ ID NO 44: gi_18376498_ref_CAC86167 . 1 , 2xC2H2, Arabidopsis 
thaliana 

MSNPACSNLFNNGCDHNSFNYSTSLSYIYNSHGSYYYSNTTNPNYINHTHTTSTSPNSPPLR 
EALPLLSLSPIRHQEQQDQHYFMDTHQISSSNFLDDPLVTVDLHLGLPNYGVGESIRSNIAP 
DATTDEQDQDHDRGVEVTVES HLDDDDDHHGDLHRGHHYWI PTP SQ ILIGPTQFT CPLCFKT 
FNRYNNMQMHMWGHGSQYRKGPESLRGTQPTGMLRLPCFCCAPGCKNNIDHPRAKPLKDFRT 
LQTH YKRKHG S KP FACRMCGKAF AVKG DWRT HEKNC GKLW YC S C GS DFKHKRS LKDHVKAFG 
NGHVPC G I DS FGG DHE D Y Y DAAS D I EQ 

SEQ ID NO 45: gi_7798991_ref_AL355775.1_region 7957 ... 8451, 
2xC2H2, Arabidopsis thaliana 

ATGGTTGCGAGAAGTGAGGAAGTTGAGATAGTGGAAGATACGGCGGCGAAATGTTTGATGTT 
GTTATCAAGAGTTGGAGAATGCGGCGGAGGAGGAGAGAAACGAGTTTTCCGATGCAAGACTT 
GTCTTAAAGAGTTTTCGTCGTTTCAAGCTTTGGGAGGTCATCGTGCAAGCCACAAGAAACTC 
ATTAACAGTAGCGATCCATCACTTCTTGGATCCTTGTCTAACAAGAAAACTAAAACGGCGAC 
GTCTCATCCTTGTCCGATATGTGGCGTGGAGTTTCCGATGGGGCAAGCTCTTGGTGGTCACA 
TGAGGAGACATAGGAGTGAGAAAGCCTCACCAGGCACGTTGGTTACACGTTCTTTTTTACCG 
GAGACGACGACGGTGACGACTTTGAAAAAATCGAGTAGTGGGAAGAGAGTGGCTTGTTTGGA 
CTTAGATTCGATGGAGAGTTTAGTCAATTGGAAGTTGGAGTTGGGAAGAACGATTTCTTGA 

SEQ ID NO 46: gi_7798996_ref_CAB90935 . 1, 2xC2H2, Arabidopsis 
thaliana 

MVARSEEVEIVEDTAAKCLMLLSRVGECGGGGEKRVFRCKTCIiKEFSSFQALGGHRASHKKL 
INSSDPSLLGSLSNKKTKTATSHPCPICGVEFPMGQALGGHMRRHRSEKASPGTLVTRSFLP 
ET TTVT TLKKS S S GKRVACLDLD SMES LVNWKLE LGRT I S 

SEQ ID NO 47: gi_9755794_ref_AL391143 . 1 _region 31730 ... 
32938, 2xC2H2, Arabidopsis thaliana 

ATGGAAGACGAACATCAAGATCTCCATAAACCCATTAATGGAGCTTTGCGAGACCTCAAGAT 
TACTCGGTCACAGAAAGAAA.CAGAAAAGTCTACGAACCAACAGCAAGATGTTACTTGTTACT 
ATGGTCTAAGGGAAAACTCGAAGAAGAAAACCCAGGAATCTCCGGAACCAATGAAGAAGATT 
TTGTTTCGATGCGAAGAATGTGGAAAAGGGTTTCGGTACGAGAAATATTTTAAGAATCATCG 
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CTCGATGATGCATTTATCGCCGAACGAGAAGGTTTGTGAAGAATCCTTGATGACTCTGTCTC 
GTAGCCTTGGGTTTGTGAAGAAGAAGAAAAGATCAAGACTTGGTAGGTCTGGGAAGACTTTA 
TTTACTACGTTTCTTGAACCGAGTTCTATTTTTGATGCGACTGATGAAGAATTAGAAGTGGC 
GGATTGTTTGATTCTATTGTCTAAGAGTGCTCCCAAGGTTGTAGACGAATTGAAAAGTCTTT 
CTGAGGCAGTACGTGTTACTCCTGAAACACCTGAAAGTAGCTATGATTTGGGTTGTTTGCTC 
AACAAGAAACCGAGAAAAGGTGGTGAATTGGAATCTGGGGTTTTAAGTAATGAGCAAAGACT 
TATGGAAGAAGGGTTTAGTAGTTATGGAACATCGAAAGAACCAGCTAGCTTCTTGAGAGACG 
AAAACAGATTGGATCAGCAGAAACGGAGAAAAGATGGTGAATTTGAATCCGGACTTTTGAGT 
AATGAGCAAAGACTGCTAGAAGAAGAGATTACTACTCCTGTGACATTCAAAGGTCCAGCGAG 
TTCCTTGAGACACAAGTGTGCTTTGGATCGAAATGGAGGTGAATTTGGTCCTGAGTTTTTGA 
GTAATGAGCAAACACTGATGGAAGAAACATGGAAAGAACCAGTGAGTTTCTTAGAAGATAAG 
CATGAATTTGATCAGCGGAAAATGCGAGAAGCTGGCGACTTTGAATCTAGGTTTTACAGAAT 
TGAGCTTGGAGTAGGAGCTATGGAGTGTACTTCTTCAGATACTGATATGCTCACGCAATCTG 
ATAAGAAGAACGTTGAGCATCGATGCAGGTTGTGCAACAAGATATTCTCGTCTTATCAAGCT 
CTAGGGGGTCATCAGACGTTTCATCGGATGAGCAAATGTAAGAACAAGAAGAATGGCATAGA 
GGAATCAGTTGAACCCAGGATGACTCTGTGA 

SEQ ID NO 48: gi_9755803_ref_CAC01747.1, 2xC2H2, Arabidopsis 

^DEHQDLHKPINGALRDIJCITRSQKETEKSTNQQQDVTCYYGLKENSKKKTQESPEPMKKI 
LFRCEECGKGFRYEKYFKNHRSMMHLSPNEKVCEESLMTLSRSLGFVKKKKRSRLGRSGKTL 
FTTFLEPSSIFDATDEELEVADCLILLSKSAPKWDELKSLSEAVRVTPETPESSYDLGCLL 
NKKPRKGGELESGVLSNEQRLMEEGFSSYGTSKEPASFLRDENRLDQQKRRKDGEFESGLLS 
NEQRLLEEEITTPVTFKGPASSLRHKCALDRNGGEFGPEFLSNEQTLMEETWKEPVSFLEDK 
HEFDQRKMRE AGDFESRFYRIELGVGAMECT SSDT DMIiTQSDKKNVEHRCRLCNKI FS SYQA 
LGGHQTFHRMSKCKNKKNGIEESVEPRMTL 

SEQ ID NO 49: gi_1418338_ref_X98678 . 1, 2xC2H2, Arabidopsis 

CTTGTTAGTTCACTCCACATAATAAACACCAAAGATTTCATTCTCTTCTCCATAATTTCGAA 

TACTATATTTGGATATGATGATGGGTCAAGATGAGGTTGGGAGTGATCAGACGCAAATCATA 
AAAGGGAAACGTACGAAGCGACAAAGATCGTCTTCGACGTTTGTGGTGACGGCGGCGACAAC 
AGTGACTTCAACAAGTTCATCGGCCGGTGGAAGTGGAGGAGAAAGAGCTGTTTCAGATGAAT 

ATTTGTCTCATCATGTTAGCTCGTGGGACAGTTCTTCCATCGCCGGATCTCAAGAACTCGAG 
AAAAATTCATCAGAAGATTTCGTCGGAGAATTCTAGTTTCTATGTGTACGAGTGTAAAACGT 
GTAACCGGACGTTTTCGTCGTTCCAAGCACTTGGTGGACACAGAGCGAGCCACAAGAAGCCG 
AGGACGTCGACTGAGGAAAAGACTAGACTACCCCTGACGCAACCCAAGTCTAGTGCATCAGA 
AGAAGGGCAAAACAGTCATTTCAAAGTTTCCGGCTCAGCCCTAGCTTCACAGGCAAGTAACA 
TCATCAACAAGGCAAACAAAGTACACGAGTGTTCCATCTGCGGTTCTGAGTTCACTTCCGGG 
CAAGCTCTCGGTGGTCACATGAGGCGGCACAGGACAGCCGTAACCACGATTAGCCCCGTTGC 
AGCCACCGCAGAAGTAAGCAGAAACAGTACAGAGGAAGAGATTGAGATCAATATAGGCCGTT 
CGATGGAACAGCAGAGGAAATATCTACCGTTGGATCTTAATCTACCAGCACCAGGAGATGAT 
CTAAGAGAGTCCAAGTTTCAAGGGATAGTATTCTCAGCAACACCAGCGTTAATAGATTGTCA 
TTACTAGTTGTTTTTTTTACTACATAATATGATGAAATATTTGTGAATTCTTCTTACTTACT 
ACTATATTGTTGATCAAAAAAAAAAAAAAAAAA 
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SEQ ID NO 50: gi_1418339_ref_CAA67236. 1, 2xC2H2, Arabidopsis 
thaliana 

MGQDEVGSDQTQIIKGKRTKRQRSSSTFWTAATTVTSTSSSAGGSGGERAVSDEYNSAVSS 
PVTTDCTQEEEDMAICLIMLARGTVLPSPDLKNSRKIHQKISSENSSFYVYECKTCNRTFSS 
FQALGGHRASHKKPRTSTEEKTRLPLTQPKSSASEEGQNSHFKVSGSALASQASNIINKANK 
VHECSICGSEFTSGQALGGHMRRHRTAVTTISPVAATAEVSRNSTEEEIEINIGRSMEQQRK 
YLPLDLNLPAPGDDLRESKFQGIVFSATPALIDCHY 
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